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POLYNUCLEOTIDES ENCODING PROMYOSTATIN POLYPEPTIDES 

This application is a continuation of U.S. Serial No. 09/628,112, filed 
July 27, 2000, which is a continuation-in-part (CIP) of U.S. Serial No. 09/124,180, 
5 filed July 28, 1998; which is a CIP of U.S. Serial No. 09/019,070, filed 

February 5, 1998, which is a CIP of U.S. Serial No. 08/862,445, filed May 23, 1997 
(now abandoned), which is a CIP of U.S. Serial No. 08/847,910, filed April 28, 1997 
(now abandoned), which is a CIP of U.S. Serial No. 08/795,071, filed February 5, 
1997 (now U.S. Patent No. 5,994,618, which issued November 11, 1999); which is a 
10 CIP of U.S. Serial No. 08/525,596, filed October 26, 1995 (now U.S. Patent 

No. 5,827,733, which issued October 27, 1998), the entire contents of each of which 
is incorporated herein by reference. 

This invention was made in part with government support under Grant 
15 No. R01HD35887 awarded by the National Institutes of Health. The government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 

20 The invention relates generally to peptide portions of promyostatin (growth 

differentiation factor-8; GDF-8) polypeptides, and more specifically to compositions 
that affect myostatin signal transduction in a cell, and to methods of using such 
compositions to modulate myostatin signal transduction in a cell. 

25 BACKGROUND INFORMATION 

The amount of time, effort and money spent in the United States each year by 
individuals intent on losing weight is staggering. For many of these individuals, the 
goal is not merely to look better, but more importantly to avoid the inevitable medical 
problems associated with being overweight. 

30 
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Greater than half of the adult population in the United States is considered to 
be overweight. Furthermore, twenty to thirty percent of adult men and thirty to forty 
percent of adult women in the United States are considered obese, with the highest 
rates occurring among the poor and minorities. Obesity, which is defined a being at 
5 least about twenty percent above the mean level of adiposity, has dramatically 

increased in prevalence over the past few decades and is becoming a major problem 
among the pediatric population. Twenty percent of all children are now considered 
overweight, a number that represents a doubling over the past five years. 

10 Obesity and the medical problems directly attributable to it are a major cause 

of morbidity and mortality throughout the world. Obesity is a major risk factor for the 
development of various pathologic conditions, including atherosclerosis, 
hypertension, heart attack, type II diabetes, gallbladder disease, and certain cancers, 
and contributes to premature death. Heart disease is the leading cause of mortality in 

1 5 the United States, and type II diabetes afflicts over 16 million people in the United 
States and is one of the leading causes of death by disease. 

More than eighty percent of type II diabetes occurs in obese persons. 
Although type II diabetes affects all races, it is particularly prevalent among Native 

20 Americans, African Americans and Hispanics. Significantly, type II diabetes, which 
used to occur almost exclusively in adults over age forty, now occurs in children, with 
reported cases having almost tripled over the last five years. Type II diabetes, also 
called non-insulin dependent diabetes, is characterized by reduced secretion of insulin 
in response to glucose and by resistance of the body to the action of insulin, even 

25 though insulin levels in the circulation generally are normal or elevated. Type II 

diabetes affects the function of a variety of different tissues and organs and can lead 
to vascular disease, renal failure, retinopathy and neuropathy. 

In contrast to the medical problems associated with obesity, the severe weight 
30 loss that commonly occurs in patients with certain chronic diseases also presents a 
challenge to medical intervention. The molecular basis for this weight loss, referred 
to as cachexia, is not well understood. It is clear, however, that cachexia complicates 
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management of such diseases and is associated with a poor prognosis for the patients. 
The effects of cachexia are evident in the wasting syndrome that occurs in cancer and 
AIDS patients. 

5 Although great efforts have been made in attempting to elucidate the 

biological processes involved in regulating body weight, the results have provided 
more fanfare than actual value. For example, the discovery of leptin has been hailed 
as a breakthrough in understanding the molecular basis for fat accumulation in 
humans, and, with it, the promise of a cure for obesity. Studies in animals indicated 
10 that leptin is involved in transmitting internal signals regulating appetite, and 

suggested leptin could be useful for treating humans suffering from obesity. Progress 
in using leptin for treating obesity has been slow, however, and, thus far, leptin has 
not met initial expectations. 

1 5 Treatment of the morbidly obese currently is limited to surgery to remove 

portions of the intestine, thereby reducing the amount of food (and calories) absorbed. 
For the moderately obese, the only "treatment" is eating a healthy diet and exercising 
regularly, a method that has proved modestly successful at best. Thus, a need exists 
to identify the biological factors involved in regulating body weight, including muscle 

20 development and fat accumulation, such that methods for treating disorders such as 
obesity and cachexia can be developed. The present invention satisfies this need and 
provides additional advantages. 

SUMMARY OF THE INVENTION 

25 The present invention relates to a substantially purified peptide portion of a 

promyostatin polypeptide. A promyostatin polypeptide is exemplified herein by a 
mammalian promyostatin having an amino acid sequence such as that of human 
promyostatin (SEQ ID NO: 2); murine promyostatin (SEQ ID NO: 4); rat 
promyostatin (SEQ ID NO: 6); baboon promyostatin (SEQ ID NO: 10); bovine 

30 promyostatin (SEQ ID NO: 12); porcine promyostatin (SEQ ID NO: 14); and ovine 
promyostatin (SEQ ID NO: 16); by an avian promyostatin having an amino acid 
sequence such as that of chicken promyostatin (SEQ ID NO: 8), and turkey 
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promyostatin (SEQ ID NO: 1 8); and by a piscine promyostatin such as a zebrafish 
promyostatin polypeptide having an amino acid sequence as set forth in SEQ ID 
NO: 20. In addition, a promyostatin polypeptide is exemplified by a polypeptide 
comprising the portion of salmon allele 1 (SEQ ID NO: 27) or the portion of salmon 
5 allele 2 (SEQ ID NO: 29) disclosed herein. 

The present invention also relates to a proteolytic fragment of a growth 
differentiation factor (GDF) polypeptide (pro-GDF) or a functional peptide portion 
thereof. Such proteolytic fragments are exemplified herein by proteolytic fragments 

10 of a promyostatin polypeptide and of a pro-GDF- 1 1 polypeptide, and by functional 
peptide portions of such proteolytic fragments. A proteolytic fragment of a pro-GDF 
polypeptide or functional peptide portion thereof is characterized, in part, by having 
or affecting an activity associated with GDF signal transduction. As such, a 
promyostatin polypeptide or functional peptide portion thereof is can have, for 

1 5 example, myostatin receptor binding activity, myostatin signal transduction 

stimulatory or inhibitory activity, myostatin binding activity, or promyostatin binding 
activity. In one embodiment, the proteolytic fragment is a fragment produced by 
cleavage of a promyostatin polypeptide at a proteolytic cleavage recognition site 
having the amino acid sequence Arg-Xaa-Xaa-Arg (SEQ ID NO: 21). Such 

20 proteolytic recognition sites are exemplified by the Arg-Ser-Arg-Arg (SEQ ID 
NO: 22) sequence shown as amino acid residues 263 to 266 in SEQ ID NO: 1 
(promyostatin) or amino acid residues 295 to 298 of SEQ ID NO: 25 (human 
pro-GDF- 1 1), and by the Arg-Ile-Arg-Arg (SEQ ID NO: 23) sequence shown as 
amino acid residues 263 to 266 in SEQ ID NO: 20. 

25 

In another embodiment, the proteolytic fragment is a GDF prodomain, for 
example, a myostatin prodomain, which includes amino acid residues 1 to about 262 
of a promyostatin polypeptide, or a functional peptide portion thereof, or a GDF- 11 
prodomain, which includes amino acid residues 1 to about 295 of a pro-GDF- 11 
30 polypeptide, or a functional peptide portion thereof. A myostatin prodomain is 

exemplified by amino acid residues 1 to about 263 as set forth in SEQ ID NO: 4 and 
SEQ ID NO: 6; as well as by amino acid residues 1 to about 262 as set forth in SEQ 
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ID NO: 2, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 8, SEQ ID NO: 18, SEQ 
ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 20. A functional peptide portion of a 
myostatin prodomain is exemplified by a peptide portion of a myostatin prodomain 
that can interact specifically with myostatin or with promyostatin. A GDF-1 1 
5 prodomain is exemplified by amino acid residues 1 to about 295 of SEQ ID NO: 25, 
and a functional peptide portion of a GDF-1 1 prodomain is exemplified by a peptide 
portion of a GDF-1 1 prodomain that can specifically interact with mature GDF-1 1 or 
a pro-GDF-1 1 polypeptide. Preferably, the functional peptide portion of a GDF 
prodomain reduces or inhibits the ability of the corresponding GDF or a related GDF 
10 to stimulate signal transduction. 

In still another embodiment, the proteolytic fragment of a pro-GDF 
polypeptide is a mature GDF peptide, or a functional peptide portion thereof. Thus, 
the proteolytic fragment can be a mature C-terminal myostatin peptide, which 

1 5 includes about amino acid residues 268 to 374 of a promyostatin polypeptide, or a 

mature C-terminal GDF-1 1 peptide, which includes about amino acid residues 299 to 
407 of a pro-GDF-1 1 polypeptide. A mature myostatin peptide is exemplified by 
amino acid residues about 268 to 375 as set forth in SEQ ID NO: 4 and SEQ ID 
NO: 6; by amino acid residues about 267 to 374 as set forth in SEQ ID NO: 2, SEQ 

20 ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 8, SEQ ID NO: 18, SEQ ID NO: 14, SEQ 
ID NO: 16, and SEQ ID NO: 20; and by the corresponding amino acid residues of 
SEQ ID NO: 27 and SEQ ID NO: 29. A functional peptide portion of a mature 
myostatin is exemplified by a peptide portion of myostatin that has myostatin signal 
transduction stimulatory or inhibitory activity. A mature GDF- 11 peptide is 

25 exemplified by amino acid residues 299 to 407 of SEQ ID NO: 25, and a functional 
peptide portion of a mature GDF- 11 peptide is exemplified by a peptide portion of 
mature GDF-1 1 that has GDF- 11 signal transduction stimulatory or inhibitory activity. 
The activity of the functional peptide portion of a mature GDF peptide can be, for 
example, an ability to interact specifically with its receptor, an ability to reduce or 

30 inhibit the ability of a mature GDF peptide to interact specifically with its receptor, or 
any other activity that stimulates or inhibits GDF signal transduction activity. 
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The present invention further relates to a mutant pro-GDF polypeptide, for 
example, a mutant promyostatin polypeptide or a mutant pro-GDF- 1 1 polypeptide, 
which contains an amino acid mutation that disrupts proteolytic cleavage at a 
proteolytic cleavage site having the amino acid sequence Arg-Xaa-Xaa-Arg (SEQ ID 
5 NO: 21). Thus, a mutant promyostatin or pro-GDF-1 1 polypeptide can have a 
mutation of an Arg residue of SEQ ID NO: 21 , such that the mutant polypeptide 
cannot be cleaved into a prodomain and a mature GDF peptide. Preferably, the 
mutant GDF polypeptide has a dominant negative activity with respect to a 
corresponding or related wild type GDF polypeptide. For example, a mutant 
1 0 promyostatin polypeptide or a mutant pro-GDF- 1 1 polypeptide can exhibit a 
dominant negative activity with respect to myostatin or GDF-1 1, respectively. 

The present invention also relates to a polynucleotide encoding a peptide 
portion of a promyostatin or pro-GDF-1 1 polypeptide. For example, the 

1 5 polynucleotide can encode a myostatin or GDF- 1 1 prodomain, or a functional peptide 
portion thereof. The invention further relates to antibodies that can specifically bind 
to a peptide portion of a promyostatin polypeptide, for example, to a myostatin 
prodomain or a functional peptide portion thereof, as well as to antibodies that can 
specifically bind to a peptide portion of a pro-GDF-1 1 polypeptide. A kit containing a 

20 peptide portion of a promyostatin or pro-GDF-1 1 polypeptide, or a mutant 

promyostatin or pro-GDF- 11 polypeptide, or a polynucleotide encoding a peptide 
portion of a promyostatin or a pro-GDF-1 1 polypeptide, or an antibody that 
specifically binds such a peptide portion, or a combination thereof, also is provided. 

25 In addition, the present invention relates to a method of identifying a 

functional peptide portion of a myostatin prodomain or a GDF-1 1 prodomain that 
interacts specifically with a myostatin peptide, a GDF-1 1 peptide, or both. A method 
of the invention can be performed, for example, by testing a peptide portion of a 
myostatin prodomain or a GDF- 11 prodomain for the ability to interact specifically 

30 with myostatin; and detecting a specific interaction of the peptide portion with the 
myostatin. In one embodiment, a method of the invention is performed using a 
computer system, wherein the ability of a virtual peptide portion of a myostatin 
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prodomain or a GDF-11 prodomain to specifically interact with a virtual myostatin 
peptide is tested. In another embodiment, the method is performed by contacting the 
peptide portion of the prodomain and the myostatin peptide under conditions suitable 
for a prodomain to specifically interact with a myostatin peptide. 

5 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows the amino acid sequences of murine promyostatin (SEQ ID 
NO: 4); rat promyostatin (SEQ ID NO: 6); human promyostatin (SEQ ID NO: 2); 
baboon promyostatin (SEQ ID NO: 10); bovine promyostatin (SEQ ID NO: 12); 
1 0 porcine promyostatin (SEQ ID NO: 1 4); ovine promyostatin (SEQ ID NO : 1 6) ; 

chicken promyostatin (SEQ ID NO: 8), turkey promyostatin (SEQ ID NO: 18); and 
zebrafish promyostatin (SEQ ID NO: 20). Amino acids are numbered relative to the 
human promyostatin (SEQ ID NO: 2). Dashed lines indicate gaps introduced to 
maximize homology. Identical residues among sequences are shaded. 

15 

Figure 2 shows the amino acid sequences of murine promyostatin (SEQ ID 
NO: 4) and zebrafish promyostatin (SEQ ID NO: 20), and portions of the amino acid 
sequences of salmon allele 1 promyostatin (SEQ ID NO: 27; "salmon 1") and salmon 
allele 2 promyostatin (SEQ ID NO: 29; "salmon 2"). Amino acid position relative to 
20 human promyostatin is indicated to left of each row (compare Figure 1 ; first amino 
acid of salmonl corresponds to human promyostatin 218; first amino acid of salmon2 
corresponds to human promyostatin 239). Dashed lines indicate gaps introduced to 
maximize homology. Relative amino acid positions, including gaps, is indicated 
along top of each row. Identical residues among sequences are shaded. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a substantially purified peptide portion of a 
promyostatin polypeptide. Promyostatin, which previously has been referred to as 
growth differentiation factor-8 (GDF-8), comprises an amino terminal prodomain and 
30 a C-terminal mature myostatin peptide (see U.S. Patent No. 5,827,733). Myostatin 
activity is effected by the mature myostatin peptide following its cleavage from 
promyostatin. Thus, promyostatin is a precursor polypeptide that is proteolytically 
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cleaved to produce active myostatin. As disclosed herein, the myostatin prodomain 
can inhibit myostatin activity, GDF-11 activity, or both. 

The present invention also provides a substantially purified peptide portion of 
5 a pro-GDF- 1 1 polypeptide. Pro-GDF- 1 1 , which previously has been referred to 
generally as GDF-11, comprises an amino terminal prodomain and a C-terminal 
mature GDF-11 peptide (see International Publication No. WO 98/35019, which is 
incorporated herein by reference). GDF-1 1 activity is effected by the mature GDF- 1 1 
peptide following its cleavage from pro-GDF-11. Thus, pro-GDF-11, like 
1 0 promyostatin, is a precursor polypeptide that is proteolytically cleaved to produce 
active GDF-11. As disclosed herein, the GDF-11 prodomain can inhibit GDF-11 
activity, myostatin activity, or both. 

Promyostatin and pro-GDF-11 are members of the transforming growth 
1 5 factor- p (TGF- p) superfamily, which consists of multifunctional polypeptides that 
control proliferation, differentiation, and other functions in various cell types. The 
TGF-p superfamily, which encompasses a group of structurally-related proteins that 
affect a wide range of differentiation processes during embryonic development, 
includes, for example, Mullerian inhibiting substance (MIS), which is required for 
20 normal male sex development (Behringer et al, Nature 345:167, 1990), Drosophila 
decapentaplegic (DPP) gene product, which is required for dorsal-ventral axis 
formation and morphogenesis of the imaginal disks (Padgett et al., Nature 325:81 -84, 
1987), the Xenopus Vg-1 gene product, which localizes to the vegetal pole of eggs 
(Weeks et al., Cell 51 :861-867, 1987), the activins (Mason et al., Riochem. Biophys. 
25 Res. Comm. 135:957-964, 1986), which can induce the formation of mesoderm and 
anterior structures in Xenopus embryos (Thomsen et al., Cell 63:485, 1990), and the 
bone morphogenic proteins (BMPs, osteogenin, OP-1), which can induce de novo 
cartilage and bone formation (Sampath et al., J. Biol. Chem. 265:13198, 1990). The 
TGF-p family members can influence a variety of differentiation processes, including 
30 adipogenesis, myogenesis, chondrogenesis, hematopoiesis, and epithelial cell 
differentiation (Massague, Cell 49:437, 1987; Massague, Ann. Rev. Biochem. 
67:753-791, 1998; each of which is incorporated herein by reference). 
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Many of the TGF-|3 family members have regulatory effects (positive or 
negative) on other peptide growth factors. In particular, certain members of the 
TGF-p superfamily have expression patterns or possess activities that relate to the 

5 function of the nervous system. For example, the inhibins and activins are expressed 
in the brain (Meunier et al., Proc. Natl. Ac ad. Sci.. USA 85:247, 1988; Sawchenko et 
al., Nature 334:615, 1988), and activin can function as a nerve cell survival molecule 
(Schubert et al., Nature 344:868, 1990). Another family member, growth 
differentiation factor -1 (GDF-1), is nervous system-specific in its expression pattern 

10 (Lee, Proc. Natl. Acad. Sci.. USA 88:4250, 1991), and other family members such as 
Vgr-1 (Lyons et al., Proc. Natl. Acad. Sci.. USA 86:4554, 1989; Jones et al., 
Development 111:531, 1991), OP-1 (Ozkaynak et al., J. Biol. Chem. 267:25220, 
1992), and BMP-4 (Jones et al, Development 111:531, 1991), are also expressed in 
the nervous system. Because skeletal muscle produces a factor or factors that 

1 5 promote the survival of motor neurons (Brown, Trends Neurosci. 7 : 1 0, 1 984), the 

expression of myostatin (GDF-8) and GDF-11 in muscle suggests that myostatin and 
GDF-1 1 can be trophic factors for neurons. As such, methods for modulating the 
activity of myostatin, GDF-11, or both can be useful for treating neurodegenerative 
diseases such as amyotrophic lateral sclerosis or muscular dystrophy, or for 

20 maintaining cells or tissues in culture prior to transplantation. 

The proteins of the TGF-p family are synthesized as large precursor proteins, 
which subsequently undergo proteolytic cleavage at a cluster of basic residues 
approximately 1 10 to 140 amino acids from the C-terminus, resulting in the formation 

25 of a prodomain peptide and a C-terminal mature peptide. The C-terminal mature 
peptides of the members of this family of proteins are structurally related, and the 
different family members can be classified into distinct subgroups based on the extent 
of their "homology. Although the homologies within particular subgroups range from 
70% to 90% amino acid sequence identity, the homologies between subgroups are 

30 significantly lower, generally ranging from 20% to 50%. In each case, the active 
species appears to be a disulfide-linked dimer of C-terminal peptide fragments. 
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Promyostatin and pro-GDF-11 polypeptides have been identified in 
mammalian, avian and piscine species, and myostatin is active in various other 
species, including vertebrates and invertebrates. During embryonic development and 
in adult animals, myostatin, for example, is expressed specifically by cells in the 
myogenic lineage (McPherron et al, Nature 387:83-90, 1997, which is incorporated 
herein by reference). During early embryogenesis, myostatin is expressed by cells in 
the myotome compartment of developing somites. At later embryonic stages and in 
adult animals, myostatin is expressed widely in skeletal muscle tissue, although the 
levels of expression vary considerably from muscle to muscle. Myostatin expression 
also is detected in adipose tissue, although at lower levels than in muscle. Similarly, 
GDF-1 1 is expressed in skeletal muscle and adipose tissue, as well as in adult thymus, 
spleen and uterus, and also is expressed in brain at various stages of development. 

Promyostatin polypeptides from various species share substantial sequence 
identity, and the amino acid sequences of human, murine, rat and chicken mature 
myostatin C-terminal sequence are 100% identical (see Figure 1). Promyostatin 
polypeptides are exemplified herein (see Figure 1) by human promyostatin (SEQ ID 
NO: 2); murine promyostatin (SEQ ID NO: 4); rat promyostatin (SEQ ID NO: 6); 
baboon promyostatin (SEQ ID NO: 10); bovine promyostatin (SEQ ID NO: 12); 
porcine promyostatin (SEQ ID NO: 14); ovine promyostatin (SEQ ID NO: 16); 
chicken promyostatin (SEQ ID NO: 8), turkey promyostatin (SEQ ID NO: 18); and 
zebrafish promyostatin (SEQ ID NO: 20). Promyostatin polypeptides also are 
exemplified herein by a polypeptide comprising the portions of salmon allele 1 (SEQ 
ID NO: 27; "salmonl") and of salmon allele 2 (SEQ ID NO: 29; "salmon2"; see 
Figure 2). Nucleic acid molecules encoding these promyostatin polypeptides are 
disclosed herein as SEQ IDNOS: 1,3,5, 9, 11, 13, 15,7, 17, 19, 26 and 28, 
respectively (see, also, McPherron and Lee, Proc. Natl. Acad. Sci.. USA 94:12457, 
1997, which is incorporated herein by reference). A pro-GDF-11 polypeptide is 
exemplified herein by human pro-GDF-1 1 (SEQ ID NO: 25), which is encoded by 
SEQ ID NO: 24. 
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In view of the extensive conservation among promyostatin polypeptides, 
particularly among species as diverse as humans and fish, it would be a routine matter 
to obtain polynucleotides encoding myostatin from any species, including the 
remainders of the salmonl and salmon2 sequences, and to identify promyostatin or 
myostatin expression in any species. In particular, the mature myostatin sequence 
shares significant homology to other members of the TGF-P superfamily, and 
myostatin contains most of the residues that are highly conserved among the other 
family members and in other species. Furthermore, myostatin, like the TGF-Ps and 
inhibin ps, contains an extra pair of cysteine residues in addition to the seven cysteine 
residues present in virtually all other family members. Myostatin is most homologous 
to Vgr-1 (45% sequence identity). Like other members of the TGF-P superfamily, 
myostatin is synthesized as a larger precursor promyostatin polypeptide that is 
proteolytic cleaved into an active myostatin peptide. 

Polynucleotides encoding promyostatin polypeptides of various organisms can 
be identified using well known procedures and algorithms based on identity (or 
homology) to the disclosed sequences. Homology or identity is often measured using 
sequence analysis software such as the Sequence Analysis Software Package of the 
Genetics Computer Group (University of Wisconsin Biotechnology Center, 1710 
University Avenue, Madison, WI 53705). Such software matches similar sequences by 
assigning degrees of homology to various deletions, substitutions and other 
modifications. The terms "homology" and "identity," when used herein in the context of 
two or more nucleic acids or polypeptide sequences, refer to two or more sequences or 
subsequences that are the same or have a specified percentage of amino acid residues or 
of nucleotides that are the same when compared and aligned for maximum 
correspondence over a comparison window or designated region as measured using any 
number of sequence comparison algorithms or by manual alignment and visual 
inspection. 

For sequence comparison, typically one sequence acts as a reference sequence, to 
which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are 
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designated, if necessary, and sequence algorithm program parameters are designated. 
Default program parameters can be used, or alternative parameters can be designated. 
The sequence comparison algorithm then calculates the percent sequence identities for 
the test sequences relative to the reference sequence, based on the program parameters. 

5 

The term "comparison window" is used broadly herein to include reference to a 
segment of any one of the number of contiguous positions, for example, about 20 to 600 
positions, for example, arnino acid or nucleotide position, usually about 50 to about 200 
positions, more usually about 100 to about 150 positions, in which a sequence may be 

1 0 compared to a reference sequence of the same number of contiguous positions after the 
two sequences are optimally aligned. Methods of alignment of sequence for comparison 
are well-known in the art. Optimal alignment of sequences for comparison can be 
conducted, for example, by the local homology algorithm of Smith and Waterman (Adv. 
A ppl. Math. 2:482, 1981), by the homology alignment algorithm of Needleman and 

1 5 Wunsch ( J. Mol. Biol. 48:443, 1970), by the search for similarity method of Person and 
Lipman (Proc. Natl. Acad. Sci.. USA 85:2444, 1988), each of which is incorporated 
herein by reference; by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI); or by manual alignment and 

20 visual inspection. Other algorithms for determining homology or identity include, for 
example, in addition to a BLAST program (Basic Local Alignment Search Tool at the 
National Center for Biological Information), ALIGN, AMAS (Analysis of Multiply 
Aligned Sequences), AMPS (Protein Multiple Sequence Alignment), ASSET 
(Aligned Segment Statistical Evaluation Tool), BANDS, BESTSCOR, BIOSCAN 

25 (Biological Sequence Comparative Analysis Node), BLIMPS (BLocks IMProved 
Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, CLUSTAL W, 
CONSENSUS, LCONSENSUS, WCONSENSUS, Smith- Waterman algorithm, 
DARWIN, Las Vegas algorithm, FNAT (Forced Nucleotide Alignment Tool), 
Framealign, Framesearch, DYNAMIC, FILTER, FSAP (Fristensky Sequence 

30 Analysis Package), GAP (Global Alignment Program), GENAL, GIBBS, GenQuest, 
ISSC (Sensitive Sequence Comparison), LALIGN (Local Sequence Alignment), LCP 
(Local Content Program), MACAW (Multiple Alignment Construction & Analysis 
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Workbench), MAP (Multiple Alignment Program), MBLKP, MBLKN, PIMA 
(Pattern-Induced Multi-sequence Alignment), SAGA (Sequence Alignment by 
Genetic Algorithm) and WHAT-IF. Such alignment programs can also be used to 
screen genome databases to identify polynucleotide sequences having substantially 
identical sequences. 

A number of genome databases are available for comparison, including, for 
example, a substantial portion of the human genome is available as part of the Human 
Genome Sequencing Project (J. Roach, http://weber.u. Washington.edu/~roach/human_ 
genome_ progress 2.html). In addition, at least twenty-one genomes have been 
sequenced in their entirety, including, for example, M. genitalium, M. jannaschii, 
H. influenzae, E. coli, yeast (S. cerevisiae), and D. melanogaster. Significant progress 
has also been made in sequencing the genomes of model organism such as mouse, 
C. elegans, and Arabadopsis sp. Several databases containing genomic information 
annotated with some functional information are maintained by different organizations, 
and are accessible via the internet, for example, http://wwwtigr.org/tdb; 
http://www.genetics.wisc.edu; http://genome-www. stanford.edu/~ball; http ://hiv- 
web.lanl.gov; http://www.ncbi.nlm.nih.gov; http://www.ebi.ac.uk; 
http://Pasteur.fr/other/biology; and http:// www.genome.wi.mit.edu . 

One example of a useful algorithm is BLAST and BLAST 2.0 algorithms, which 
are described by Altschul et al. ( Nucleic Acids Res. 25:3389-3402, 1977; J. Mol. Biol. 
215:403-410, 1990, each of which is incorporated herein by reference). Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov). This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of length W 
in the query sequence, which either match or satisfy some positive-valued threshold 
score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al, supra, 1977, 1990). 
These initial neighborhood word hits act as seeds for initiating searches to find longer 
HSPs containing them. The word hits are extended in both directions along each 
sequence for as far as the cumulative alignment score can be increased. Cumulative 
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scores are calculated using, for nucleotide sequences, the parameters M (reward score for 
a pair of matching residues; always >0). For amino acid sequences, a scoring matrix is 
used to calculate the cumulative score. Extension of the word hits in each direction are 
halted when: the cumulative alignment score falls off by the quantity X from its 
5 maximum achieved value; the cumulative score goes to zero or below, due to the 

accumulation of one or more negative-scoring residue alignments; or the end of either 
sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide 
sequences) uses as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M=5, N=4 
1 0 and a comparison of both strands. For amino acid sequences, the BLASTP program uses 
as defaults a wordlength of 3, and expectations (E) of 1 0, and the BLOSUM62 scoring 
matrix (see Henikoff and Henikoff, Proc. Natl. Acad. Sci.. USA 89:10915, 1989) 
alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both 
strands. 

15 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, for example, Karlin and Altschul, Proc. Natl. Acad. Sci.. 
USA 90:5873, 1993, which is incorporated herein by reference). One measure of 
similarity provided by BLAST algorithm is the smallest sum probability (P(N)), which 
20 provides an indication of the probability by which a match between two nucleotide or 

amino acid sequences would occur by chance. For example, a nucleic acid is considered 
similar to a references sequence if the smallest sum probability in a comparison of the 
test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less 
than about 0.01 , and most preferably less than about 0.001 . 

25 

In one embodiment, protein and nucleic acid sequence homologies are 
evaluated using the Basic Local Alignment Search Tool ("BLAST"). In particular, 
five specific BLAST programs are used to perform the following task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a 
30 protein sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide 
sequence database; 
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(3) BLASTX compares the six-frame conceptual translation products of a 
query nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide 
sequence database translated in all six reading frames (both strands); and 

5 (5) TBLASTX compares the six-frame translations of a nucleotide query 

sequence against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs," between a 

10 query amino or nucleic acid sequence and a test sequence which is preferably 

obtained from a protein or nucleic acid sequence database. High-scoring segment 
pairs are preferably identified (i.e., aligned) by means of a scoring matrix, many of 
which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 
matrix (Gonnet et al., Science 256:1443-1445, 1992; Henikoff and Henikoff, Proteins 

15 17:49-61, 1993, each of which is incorporated herein by reference). Less preferably, 
the PAM or PAM250 matrices may also be used (Schwartz and Dayhoff, eds., 
"Matrices for Detecting Distance Relationships: Atlas of Protein Sequence and 
Structure" (Washington, National Biomedical Research Foundation 1978)). BLAST 
programs are accessible through the U.S. National Library of Medicine, for example, 

20 at www.ncbi.nlm.nih.gov . 

The parameters used with the above algorithms may be adapted depending on 
the sequence length and degree of homology studied. In some embodiments, the 
parameters may be the default parameters used by the algorithms in the absence of 
25 instructions from the user. 



A polynucleotide encoding a promyostatin can be derived from any organism, 
including, for example, mouse, rat, cow, pig, human, chicken, turkey, zebrafish, 
salmon, finfish, other aquatic organisms and other species. Examples of aquatic 
30 organisms include those belonging to the class Piscina, such as trout, char, ayu, carp, 
crucian carp, goldfish, roach, whitebait, eel, conger eel, sardine, flying fish, sea bass, 
sea bream, parrot bass, snapper, mackerel, horse mackerel, tuna, bonito, yellowtail, 
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rockfish, fluke, sole, flounder, blowfish, filefish; those belonging to the class 
Cephalopoda, such as squid, cuttlefish, octopus; those belonging to the class 
Pelecypoda, such as clams (e.g., hardshell, Manila, Quahog, Surf, Soft-shell); cockles, 
mussels, periwinkles; scallops (e.g., sea, bay, calloo); conch, snails, sea cucumbers; 
ark shell; oysters (e.g., C. virginica, Gulf, New Zealand, Pacific); those belonging to 
the class Gastropoda such as turban shell, abalone (e.g. green, pink, red); and those 
belonging to the class Crustacea such as lobster, including but not limited to Spiny, 
Rock, and American; prawn; shrimp, including but not limited to M. rosenbergii, 
P. styllrolls, P. indicus, P.jeponious, P. monodon, P. vannemel, M. ensis, 
S. melantho, N. norvegious, cold water shrimp; crab, including, but not limited to, 
Blue, rook, stone, king, queen, snow, brown, dungeness, Jonah, Mangrove, soft- 
shelled; squilla, krill, langostinos; crayfish/crawfish, including, but not limited, to 
Blue, Marron, Red Claw, Red Swamp, Soft-shelled, white; Annelida; Chordata, 
including, but not limited to, reptiles such as alligators and turtles; Amphibia, 
including frogs; and Echinodermata, including, but not limited to, sea urchins. 

The present invention provides substantially purified peptide portions of a 
promyostatin polypeptide and substantially purified peptide portions of a pro-GDF-11 
polypeptide. As used herein, reference to a "pro-GDF," for example, promyostatin or 
pro-GDF-11, means the foil length polypeptide, including the amino terminal 
prodomain and the carboxy terminal biologically active GDF peptide. In addition, the 
prodomain includes a signal peptide (leader sequence), which comprises about the 
first 15 to 30 amino acids at the amino terminus of the prodomain. The signal peptide 
can be cleaved from the full length pro-GDF polypeptide, which can be further 
cleaved at an Arg-Xaa-Xaa-Arg (SEQ ID NO: 21) proteolytic cleavage site. 

Reference herein to amino acid residues is made with respect to the full length 
pro-GDF polypeptides as shown in Figures 1 and 2 (see, also, Sequence Listing). It 
should also be recognized that reference is made herein to particular peptides 
beginning or ending at "about" a particular amino acid residue. The term "about" is 
used in this context because it is recognized that a particular protease can cleave a 
pro-GDF polypeptide at or immediately adjacent to a proteolytic cleavage recognition 
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site, or one or a few amino acids from the recognition site. As such, reference, for 
example, to a myostatin prodomain having a sequence of about amino acid residues 1 
to 263 of SEQ ID NO: 4 would include an amino terminal peptide portion of 
promyostatin that includes the signal peptide and has a carboxy terminus ending at 
amino acid residue 257 to amino acid residue 269, preferably at amino acid 
residue 260 to amino acid residue 266. 

Similarly, the signal peptide can be cleaved at any position from about amino 
acid residue 15 to 30 of a pro-GDF polypeptide, for example, at residue 15, 20, 25 
or 30, without affecting the function, for example, of a remaining prodomain. Thus, 
for convenience, reference is made generally herein to a peptide portion of a pro-GDF 
polypeptide, from which the signal peptide has been cleaved, as beginning at about 
amino acid residue 20. However, it will be recognized that cleavage of the signal 
peptide can be at any amino acid position within about the first 1 5 to 30 amino 
terminal amino acids of a pro-GDF polypeptide. As such, reference, for example, to a 
myostatin prodomain having a sequence of about amino acid residues 20 to 263 of 
SEQ ID NO: 4 would include a peptide portion of promyostatin that lacks about the 
first 15 to 30 amino acids of promyostatin, comprising the signal peptide, and that has 
a carboxy terminus ending at amino acid residue 257 to amino acid residue 269, 
preferably at amino acid residue 260 to amino acid residue 266. 

In general, reference is made herein to a pro-GDF polypeptide or a GDF 
prodomain as beginning at about amino acid 1. In view of the above disclosure, 
however, it will be recognized that such pro-GDF polypeptides or GDF prodomains 
that lack the signal peptide also are encompassed within the present invention. 
Further in this respect, it should be recognized that the presence or absence of a signal 
peptide in a peptide of the invention can influence, for example, the compartments of 
a cell through which a peptide, for example, a myostatin prodomain will traverse and 
to which the peptide ultimately will localize, including whether the peptide will be 
secreted from the cell. Thus, the present invention further provides a substantially 
purified signal peptide portion of a pro-GDF polypeptide. As disclosed herein, such a 
signal peptide can be used to target an agent, particularly a peptide agent, to the same 
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cellular compartments as the naturally occurring GDF from which the signal peptide 
is derived. 

The term "peptide" or "peptide portion" is used broadly herein to mean two or 
5 more amino acids linked by a peptide bond. The term "fragment" or "proteolytic 
fragment" also is used herein to refer to a product that can be produced by a 
proteolytic reaction on a polypeptide, i.e., a peptide produced upon cleavage of a 
peptide bond in the polypeptide. Although the term "proteolytic fragment" is used 
generally herein to refer to a peptide that can be produced by a proteolytic reaction, it 
1 0 should be recognized that the fragment need not necessarily be produced by a 

proteolytic reaction, but also can be produced using methods of chemical synthesis or 
methods of recombinant DN A technology, as discussed in greater detail below, to 
produce a synthetic peptide that is equivalent to a proteolytic fragment. In view of the 
disclosed homology of promyo statin with other members of the TGF-p superfamily, it 
1 5 will be recognized that a peptide of the invention is characterized, in part, in that it is 
not present in previously disclosed members of this superfamily. Whether a peptide 
portion of a promyostatin or pro-GDF-1 1 polypeptide is present in a previously 
disclosed member of the TGF-p superfamily readily can be determined using the 
computer algorithms described above. 

20 

Generally, a peptide of the invention contains at least about six amino acids, 
usually .contains about ten amino acids, and can contain fifteen or more amino acids, 
particularly twenty or more amino acids. It should be recognized that the term 
"peptide" is not used herein to suggest a particular size or number of amino acids 
25 comprising the molecule, and that a peptide of the invention can contain up to several 
amino acid residues or more. For example, a full length mature C-terminal myostatin 
peptide contains more than 100 amino acids and a full length prodomain peptide can 
contain more than 260 amino acids. 

30 As used herein, the term "substantially purified" or "substantially pure" or 

"isolated" means that the molecule being referred to, for example, a peptide or a 
polynucleotide, is in a form that is relatively free of proteins, nucleic acids, lipids, 
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carbohydrates or other materials with which it is naturally associated. Generally, a 
substantially pure peptide, polynucleotide, or other molecule constitutes at least 
twenty percent of a sample, generally constitutes at least about fifty percent of a 
sample, usually constitutes at least about eighty percent of a sample, and particularly 
constitutes about ninety percent or ninety-five percent or more of a sample. A 
determination that a peptide or a polynucleotide of the invention is substantially pure 
can be made using well known methods, for example, by performing electrophoresis 
and identifying the particular molecule as a relatively discrete band. A substantially 
pure polynucleotide, for example, can be obtained by cloning the polynucleotide, or 
by chemical or enzymatic synthesis. A substantially pure peptide can be obtained, for 
example, by a method of chemical synthesis, or using methods of protein purification, 
followed by proteolysis and, if desired, further purification by chromatographic or 
electrophoretic methods. 

A peptide of the invention can be identified by comparison to a promyostatin 
or pro-GDF-11 sequence and determining that the amino acid sequence of the peptide 
is contained within the promyostatin or pro-GDF-11 polypeptide sequence, 
respectively. It should be recognized, however, that a peptide of the invention need 
not be identical to a corresponding amino acid sequence of promyostatin or 
pro-GDF- 1 1 . Thus, a peptide of the invention can correspond to an amino acid 
sequence of a promyostatin polypeptide, for example, but can vary from a naturally 
occurring sequence, for example, by containing one or more D-amino acids in place 
of a corresponding L-amino acid; or by containing one or more amino acid analogs, 
for example, an amino acid that has been derivatized or otherwise modified at its 
reactive side chain. Similarly, one or more peptide bonds in the peptide can be 
modified. In addition, a reactive group at the amino terminus or the carboxy terminus 
or both can be modified. Such peptides can be modified, for example, to have 
improved stability to a protease, an oxidizing agent or other reactive material the 
peptide may encounter in a biological environment, and, therefore, can be particularly 
useful in performing a method of the invention. Of course, the peptides can be 
modified to have decreased stability in a biological environment such that the period 
of time the peptide is active in the environment is reduced. 
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The sequence of a peptide of the invention also can be modified in comparison 
to the corresponding sequence in a promyostatin or pro-GDF-1 1 polypeptide by 
incorporating a conservative amino acid substitution for one or a few amino acids in 
5 the peptide. Conservative amino acid substitutions include the replacement of one 
amino acid residue with another amino acid residue having relatively the same 
chemical characteristics, for example, the substitution of one hydrophobic residue 
such as isoleucine, valine, leucine or methionine for another, or the substitution of one 
polar residue for another, for example, substitution of arginine for lysine; or of 
10 glutamic for aspartic acid; or of glutamine for asparagine; or the like. Examples of 
positions of a promyostatin polypeptide that can be modified are evident from 
examination of Figure 1, which shows various amino acid differences in the myostatin 
prodomain and mature myostatin peptide that do not substantially affect promyostatin 
or myostatin activity. 

15 

The present invention also provides a substantially purified proteolytic 
fragment of a growth differentiation factor (GDF) polypeptide (a pro-GDF 
polypeptide) or a functional peptide portion thereof. Proteolytic fragments of a pro- 
GDF polypeptide are exemplified herein by proteolytic fragments of a promyostatin 
20 polypeptide and proteolytic fragments of a pro-GDF-1 1 polypeptide. As disclosed 
herein, a peptide portion of a pro-GDF polypeptide that is equivalent to a proteolytic 
fragment of a pro-GDF can be produced by a chemical method or a recombinant DNA 
method. In view of the present disclosure, proteolytic fragments of other GDF 
polypeptides readily can be made and used. 

25 

In general, peptides corresponding to proteolytic fragments of a pro-GDF 
polypeptide are exemplified by a carboxy terminal (C-terminal) mature GDF 
fragment, which can interact specifically with a GDF receptor and affect GDF signal 
transduction, and an amino terminal prodomain fragment, which can include a signal 
30 peptide and, as disclosed herein, can interact specifically with a pro-GDF polypeptide 
or mature GDF peptide and affect its ability to effect GDF signal transduction. For 
example, proteolytic fragments of a promyostatin polypeptide include a C-terminal 
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mature myostatin peptide, which can interact specifically with a myostatin receptor 
and induce myostatin signal transduction; and an amino terminal prodomain fragment, 
which can interact specifically with myostatin, thereby reducing or inhibiting the 
ability of myostatin to induce myostatin signal transduction. 

A proteolytic fragment of a pro-GDF polypeptide, or a functional peptide 
portion thereof, is characterized, in part, by having or affecting an activity associated 
with the stimulation or inhibition of GDF signal transduction. For example, a 
promyostatin polypeptide or functional peptide portion thereof can have myostatin 
receptor binding activity, myostatin signal transduction stimulatory or inhibitory 
activity, myostatin binding activity, promyostatin binding activity, or a combination 
thereof. Thus, the term "functional peptide portion," when used herein in reference to 
a pro-GDF polypeptide, means a peptide portion of the pro-GDF polypeptide that can 
interact specifically with its receptor and stimulate or inhibit GDF signal transduction; 
that can interact specifically with a mature GDF or a pro-GDF; or that exhibits 
cellular localization activity, i.e., the activity of a signal peptide. It should be 
recognized that a functional peptide portion of full length mature myostatin peptide, 
for example, need not have the same activity of the mature myostatin, including the 
ability to stimulate myostatin signal transduction, since functional peptide portions of 
the mature peptide can have, for example, an ability to specifically interact with a 
myostatin receptor without also having the ability to activate the signal transduction 
pathway. Methods for identifying such a functional peptide portion of a pro-GDF 
polypeptide, which can be useful as a myostatin antagonist, are disclosed herein or 
otherwise known in the art. Thus, in one embodiment, a functional peptide portion of 
a promyostatin polypeptide can interact specifically with a myostatin receptor, and 
can act as an agonist to stimulate myostatin signal transduction or as an antagonist to 
reduce or inhibit myostatin signal transduction. 

In another embodiment, a functional peptide portion of a promyostatin 
polypeptide can interact specifically with a promyostatin polypeptide or with a mature 
myostatin peptide, thereby blocking myostatin signal transduction. Such a functional 
peptide portion of promyostatin can act, for example, by preventing cleavage of a 
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promyostatin polypeptide to mature myostatin; by forming a complex with a mature 
myostatin peptide; or by some other mechanism. Where a peptide-myostatin complex 
is formed, the complex can block myostatin signal transduction, for example, by 
reducing or inhibiting the ability of the myostatin to interact specifically with its 
5 receptor, or by binding to the receptor in the form that lacks the ability to induce 
myostatin signal transduction. 

Proteolytic fragments of a pro-GDF polypeptide can be produced by cleavage 
of the polypeptide at a proteolytic cleavage site having a consensus amino acid 

10 sequence Arg-Xaa-Xaa-Arg (SEQ ID NO: 21). Such proteolytic recognition sites are 
exemplified by the Arg-Ser-Arg-Arg (SEQ ID NO: 22) sequence shown as amino acid 
residues 263 to 266 in SEQ ID NO: 1 (promyostatin) or amino acid residues 295 to 
298 of SEQ ID NO: 25 (human pro-GDF-11; see, also, relative positions 267 to 270 
of Figure 2), and by the Arg-Ile-Arg-Arg (SEQ ID NO: 23) sequence shown as amino 

1 5 acid residues 263 to 266 in SEQ ID NO: 20. 

In addition to the proteolytic cleavage site for the signal peptide, promyostatin 
polypeptides, for example, contain two additional potential proteolytic processing 
sites (Lys-Arg and Arg-Arg). Cleavage of a promyostatin polypeptide at or near the 

20 latter proteolytic processing site, which is contained within the consensus 

Arg-Xaa-Xaa-Arg (SEQ ID NO: 21) proteolytic cleavage recognition site (see, for 
example, amino acid residues 263 to 266 of SEQ ID NO: 2), generates a biologically 
active C-terminal mature human myostatin fragment. The exemplified full length 
mature myostatin peptides contain about 103 to about 109 amino acids and have a 

25 predicted molecular weight of approximately 12,400 daltons (Da). In addition, 

myostatin can form dimers, which have an expected molecular weight of about 23 to 
30 kiloDaltons (kDa). The dimers can be myostatin homodimers or can be 
heterodimers, for example, with GDF-1 1 or another GDF or TGF-P family member. 

30 A proteolytic fragment of the invention is exemplified by a GDF prodomain, 

for example, a myostatin prodomain, which includes about amino acid residues 20 
to 262 of a promyostatin polypeptide, or a functional peptide portion thereof, or a 
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GDF-11 prodomain, which includes about amino acid residues 20 to 295 of a 
pro-GDF-11 polypeptide, or a functional peptide portion thereof, each of which can 
further contain the signal peptide comprising about amino acids 1 to 20 of the 
respective pro-GDF polypeptide. Myostatin prodomains are further exemplified by 
about amino acid residues 20 to 263 as set forth in SEQ ID NO: 4 and SEQ ID NO: 6; 
as well as by about amino acid residues about 20 to 262 as set forth in SEQ ID NO: 2, 
SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 8, SEQ ID NO: 18, SEQ ID NO: 14, 
SEQ ID NO: 16, SEQ ID NO: 20, which can be produced by proteolytic cleavage of a 
corresponding promyostatin polypeptide, can be chemically synthesized, or can be 
expressed from a recombinant polynucleotide encoding the proteolytic fragment. A 
functional peptide portion of a myostatin prodomain is exemplified by a peptide 
portion of a myostatin prodomain that can interact specifically with myostatin or with 
promyostatin. A GDF-11 prodomain is exemplified by about amino acid residues 20 
to 295 of SEQ ID NO: 25, which can further include the signal peptide comprising 
about amino acid residues 1 to 20 of SEQ ID NO: 25, and a functional peptide portion 
of a GDF-11 prodomain is exemplified by a peptide portion of a GDF-11 prodomain 
that can specifically interact with mature GDF-11 or a pro-GDF-11 polypeptide. 
Preferably, the functional peptide portion of a GDF prodomain inhibits the ability of 
the corresponding GDF or a related GDF to stimulate signal transduction, for 
example, by reducing or inhibiting the ability of the GDF to interact specifically with 
its receptor, or by binding to the receptor as an inactive complex. In one embodiment, 
the present invention provides a functional fragment of a pro-GDF polypeptide, 
particularly a functional fragment of a GDF prodomain, operably linked to a GDF 
signal peptide, preferably a myostatin signal peptide or a GDF-11 signal peptide 
comprising about the first 15 to 30 amino terminal amino acids of promyostatin or 
pro-GDF- 1 1 , respectively. 

As disclosed herein, a myostatin prodomain or GDF-1 1 prodomain can 
interact. with mature myostatin, GDF-11, or both, thereby reducing or inhibiting the 
ability of the mature GDF to interact specifically with its receptor (see Examples 7 
and 8). Thus, a functional peptide portion of a myostatin prodomain, for example, can 
be obtained by examining peptide portions of a myostatin prodomain using methods 
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as provided herein, and identifying functional peptide portions of the prodomain that 
can interact specifically with myostatin or with promyostatin and can reduce or inhibit 
the ability of myostatin to interact specifically with a myostatin receptor or to 
stimulate myostatin signal transduction. 

A functional peptide portion of a myostatin prodomain that can specifically 
interact with myostatin, or a functional peptide portion of another GDF prodomain, 
also can be identified using any of various assays known to be useful for identifying 
specific protein-protein interactions. Such assays include, for example, methods of 
gel electrophoresis, affinity chromatography, the two hybrid system of Fields and 
Song (Nature 340:245-246, 1989; see, also, U.S. Patent No. 5,283,173; Fearon et al, 
Proc. Natl. Acad. Sci.. USA 89:7958-7962, 1992; Chien et al., Proc. Natl. Acad. Sci. 
USA 88:9578-9582, 1991; Young. Biol. Reprod. 58:302-311(1998), each of which is 
incorporated herein by reference), the reverse two hybrid assay (Leanna and Hannink, 
Nucl. Acids Res. 24:3341-3347, 1996, which is incorporated herein by reference), the 
repressed transactivator system (U.S. Patent No. 5,885,779, which is incorporated 
herein by reference), the phage display system (Lowman, Ann. Rev. Biophys. 
Biomol. Struct. 26:401-424, 1997, which is incorporated herein by reference), 
GST/HIS pull down assays, mutant operators (WO 98/01879, which is incorporated 
herein by reference), the protein recruitment system (U.S. Patent No. 5,776,689, 
which is incorporated herein by reference), and the like (see, for example, Mathis, 
Clin. Chem. 41:139-147. IQQSTmr. Anticancer Drug Res. 12:145-167. 1997; 
Phizicky et al., Microbiol. Rev. 59:94-123, 1995; each of which is incorporated herein 
by reference). 

A functional peptide portion of a GDF prodomain also can be identified using 
methods of molecular modeling. For example, an amino acid sequence of a mature 
myostatin peptide can be entered into a computer system having appropriate modeling 
software, and a three dimensional representation of the myostatin ("virtual 
myostatin") can be produced. A promyostatin amino acid sequence also can be 
entered into the computer system, such that the modeling software can simulate 
portions of the promyostatin sequence, for example, portions of the prodomain, and 
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can identify those peptide portions of the prodomain that can interact specifically with 
the virtual myostatin. A base line for a specific interaction can be predefined by 
modeling the virtual myostatin and a full length promyostatin prodomain, and 
identifying the amino acid residues in the virtual myostatin that are "contacted" by the 
5 prodomain, since such an interaction is known to inhibit the activity of the myostatin. 

It should be recognized that such methods, including two hybrid assays and 
molecular modeling methods, also can be used to identify other specifically 
interacting molecules encompassed within the present invention. Thus, methods such 

10 as the two hybrid assay can be used to identify a GDF receptor such as a myostatin 
receptor using, for example, a myostatin peptide or a peptide portion thereof that 
specifically interacts with an Act RIIA or Act RIIB receptor as one binding 
component of the assay, and identifying a GDF receptor, which specifically interacts 
with the myostatin peptide. Similarly, methods of molecular modeling can be used to 

1 5 identify an agent that interacts specifically with a mature GDF peptide such as mature 
myostatin, or with a GDF receptor and, therefore, can be useful as an agonist or an 
antagonist of signal transduction mediated by the GDF or the GDF receptor. Such an 
agent can be, for example, a functional peptide portion of a myostatin prodomain or 
GDF-1 1 prodomain, or a chemical agent that mimics the action of the GDF 

20 prodomain. 

Modeling systems useful for the purposes disclosed herein can be based on 
structural information obtained, for example, by crystallographic analysis or nuclear 
magnetic resonance analysis, or on primary sequence information (see, for example, 

25 Dunbrack et al., "Meeting review: the Second meeting on the Critical Assessment of 
Techniques for Protein Structure Prediction (CASP2) (Asilomar, California, 
December 13-16, 1996). Fold Pes. 2(2): R27-42, (1997); Fischer and Eisenberg, 
Protein Sci. 5:947-55, 1996; (see, also, U.S. Patent No. 5,436,850); Havel, Prog. 
Biophvs. Mol. Biol. 56:43-78, 1991; Lichtarge et al, J. Mol. Biol. 274:325-37, 1997; 

30 Matsumoto et al., J. Biol. Chem. 270:19524-31, 1995; Sali et al., J. Biol. Chem. 
268:9023-34, 1993; Sali, Molec. Med. Today 1:270-7, 1995a; Sali, Curr. Opin. 
Biotechnol. 6:437-51, 1995b; Sali et al., Proteins 23: 318-26, 1995c; Sali, Nature 
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Struct. Biol. 5:1029-1032, 1998; U.S. Patent No. 5,933,819; U.S. Patent 
No. 5,265,030, each of which is incorporated herein by reference). 

The crystal structure coordinates of a promyostatin polypeptide or a GDF 
receptor can be used to design compounds that bind to the protein and alter its 
physical or physiological properties in a variety of ways. The structure coordinates of 
the protein can also be used to computationally screen small molecule data bases for 
agents that bind to the polypeptide to develop modulating or binding agents, which 
can act as agonists or antagonists of GDF signal transduction. Such agents can be 
identified by computer fitting kinetic data using standard equations (see, for example, 
Segel, "Enzyme Kinetics" (J. Wiley & Sons 1975), which is incorporated herein by 
reference). 

Methods of using crystal structure data to design inhibitors or binding agents 
are known in the art. For example, GDF receptor coordinates can be superimposed 
onto other available coordinates of similar receptors, including receptors having a 
bound inhibitor, to provide an approximation of the way the inhibitor interacts with 
the receptor. Computer programs employed in the practice of rational drug design 
also can be used to identify compounds that reproduce interaction characteristics 
similar to those found, for example, between a mature myostatin and a co-crystallized 
myostatin prodomain. Detailed knowledge of the nature of the specific interactions 
allows for the modification of compounds to alter or improve solubility, 
pharmacokinetics, and the like, without affecting binding activity. 

Computer programs for carrying out the activities necessary to design agents 
using crystal structure information are well known. Examples of such programs 
include, Catalyst Databases™ - an information retrieval program accessing chemical 
databases such as BioByte Master File, Derwent WDI and ACD; Catalyst/HYPO™ - 
generates models of compounds and hypotheses to explain variations of activity with 
the structure of drug candidates; Ludi™ - fits molecules into the active site of a 
protein by identifying and matching complementary polar and hydrophobic groups; 
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and Leapfrog™ - "grows" new ligands using a genetic algorithm with parameters 
under the control of the user. 

Various general purpose machines can be used with such programs, or it may be 
more convenient to construct more specialized apparatus to perform the operations. 
Generally, the embodiment is implemented in one or more computer programs executing 
on programmable systems each comprising at least one processor, at least one data 
storage system (including volatile and non-volatile memory and/or storage elements), at 
least one input device, and at least one output device. The program is executed on the 
processor to perform the functions described herein. 

Each such program can be implemented in any desired computer language, 
including, for example, machine, assembly, high level procedural, or object oriented 
programming languages, to communicate with a computer system. In any case, the 
language may be a compiled or interpreted language. The computer program will 
typically be stored on a storage media or device, for example, a ROM, CD-ROM, 
magnetic or optical media, or the like, that is readable by a general or special purpose 
programmable computer, for configuring and operating the computer when the storage 
media or device is read by the computer to perform the procedures described herein. 
The system may also be considered to be implemented as a computer-readable storage 
medium, configured with a computer program, where the storage medium so configured 
causes a computer to operate in a specific and predefined manner to perform the 
functions described herein. 

Embodiments of the invention include systems, for example, internet based 
systems, particularly computer systems which store and manipulate coordinate 
information obtained by crystallographic or NMR analysis, or amino acid or nucleotide 
sequence information, as disclosed herein. As used herein, the term "computer system" 
refers to the hardware components, software components, and data storage components 
used to analyze coordinates or sequences as set forth herein. The computer system 
typically includes a processor for processing, accessing and manipulating the sequence 
data. The processor can be any well known type of central processing unit, for example, 
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a Pentium II or Pentium III processor from Intel Corporation, or a similar processor from 
Sun, Motorola, Compaq, Advanced MicroDevices or International Business Machines. 

Typically the computer system is a general purpose system that comprises the 
5 processor and one or more internal data storage components for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage 
components. A skilled artisan can readily appreciate that any one of the currently 
available computer systems are suitable. 

10 In one embodiment, the computer system includes a processor connected to a 

bus, which is connected to a main memory, preferably implemented as RAM, and one or 
more internal data storage devices such as a hard drive or other computer readable media 
having data recorded thereon. In some embodiments, the computer system further 
includes one or more data retrieving devices for reading the data stored on the internal 

1 5 data storage devices. 

The data retrieving device may represent, for example, a floppy disk drive, a 
compact disk drive, a magnetic tape drive, or a modem capable of connection to a 
remote data storage system (e.g., via the internet). In some embodiments, the internal 
20 data storage device is a removable computer readable medium such as a floppy disk, a 
compact disk, a magnetic tape, etc. containing control logic and/or data recorded 
thereon. The computer system may advantageously include or be programmed by 
appropriate software for reading the control logic and/or the data from the data storage 
component once inserted in the data retrieving device. 

25 

The computer system generally include a display, which is used to display output 
to a computer user. It should also be noted that the computer system can be linked to 
other computer systems in a network or wide area network to provide centralized access 
to the computer system. 

30 

Where it is desired to identify a chemical entity that interacts specifically with 
myostatin or with a GDF receptor, any of several methods to screen chemical entities or 

Gray Cary\GT\6209497.1 
104659-160348 



29 



fragments for their ability to interact specifically with the molecule can be used. This 
process may begin by visual inspection, for example, of myostatin and a myostatin 
prodomain on the computer screen. Selected peptide portions of the prodomain, or 
chemical entities that can act as mimics, then can be positioned in a variety of 
5 orientations, or docked, within an individual binding site of the myostatin. Docking can 
be accomplished using software such as Quanta and Sybyl, followed by energy 
minimization and molecular dynamics with standard molecular mechanics forcefields, 
such as CHARMM and AMBER. 

1 0 Specialized computer programs can be particularly useful for selecting peptide 

portions of a prodomain, or chemical entities useful, for example, as a GDF receptor 
agonist or antagonist. Such programs include, for example, GRID (Goodford, J. Med. 
Chem. . 28:849-857, 1985; available from Oxford University, Oxford, UK); MCSS 
(Miranker and Karplus, Proteins: Structure. Function and Genetics 1 1 :29-34, 1991, 

1 5 available from Molecular Simulations, Burlington MA); AUTODOCK (Goodsell and 
Olsen, Proteins: Structure. Function, and Genetics 8:195-202, 1990, available from 
Scripps Research Institute, La Jolla CA); DOCK (Kuntz, et al., J. Mol. Biol. 161 :269- 
288, 1982, available from University of California, San Francisco CA), each of which is 
incorporated herein by reference. 

20 

Suitable peptides or agents that have been selected can be assembled into a single 
compound or binding agent. Assembly can be performed by visual inspection of the 
relationship of the fragments to each other on the three-dimensional image displayed on 
a computer screen, followed by manual model building using software such as Quanta or 

25 Sybyl. Useful programs to aid one of skill in the art in connecting the individual 

chemical entities or fragments include, for example, CAVEAT (Bartlett et al, Special 
Pub.. Roval Chem. Soc. 78:182-196, 1989, available from the University of California, 
Berkeley CA); 3D Database systems such as MACCS-3D (MDL Information Systems, 
San Leandro CA; for review, see Martin. J. Med. Chem. 35:2145-2154, 1992); HOOK 

30 (available from Molecular Simulations, Burlington, Mass.), each of which is 
incorporated herein by reference. 
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In addition to the method of building or identifying such specifically interacting 
agents in a step-wise fashion, one fragment or chemical entity at a time as described 
above, the agents can be designed as a whole or de novo using either an empty active site 
or, optionally, including some portions of a known agent that specifically interacts, for 
example, a full length myostatin prodomain, which interacts specifically with myostatin. 
Such methods include, for example, LUDI (Bohm, J. Comp. Aid. Molec. Design 6:61- 
78, 1992, available from Biosym Technologies, San Diego CA); LEGEND (Nishibata 
and Itai, Tetrahedron 47:8985, 1991, available from Molecular Simulations, Burlington 
MA); LeapFrog (available from Tripos Associates, St. Louis MO), and those described 
by Cohen et al. (J. Med. Chem. 33:883-894, 1990) and by Navia and Murcko, Curr. 
Opin. Struct. Biol. 2:202-210, 1992, each of which is incorporated herein by reference). 

Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interaction. Examples of programs designed for 
such uses include Gaussian 92, revision C (Frisch, Gaussian, Inc., Pittsburgh PA, 1992); 
AMBER, version 4.0 (Kollman, University of California at San Francisco, 1994); 
QUANTA/CHARMM (Molecular Simulations, Inc., Burlington MA, 1994); and Insight 
II/Discover (Biosysm Technologies Inc., San Diego CA, 1994). These programs may be 
implemented using, for example, a Silicon Graphics workstation, IRIS 4D/35 or IBM 
RISC/6000 workstation model 550. Other hardware systems and software packages will 
be known to those skilled in the art of which the speed and capacity are continually 
modified. 

A molecular modeling process for identifying an agent that interacts specifically 
with a molecule of interest, for example, with a mature GDF peptides such as mature 
myostatin, or with a GDF receptor can be performed as disclosed herein. In a first step, 
a virtual representation of a target molecule, for example, myostatin, is performed. 
Thus, in one embodiment, the present invention provides a virtual representation of a 
target molecule, wherein the target molecule is selected from a pro-GDF polypeptide, for 
example, promyostatin; a peptide portion of a pro-GDF polypeptide; a GDF receptor; 
and a relevant domain of a GDF receptor, for example, a GDF binding domain. The 
virtual representation of the target molecule can be displayed or can be maintained in a 
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computer system memory. The process begins at a start state, comprising the virtual 
target molecule, then moves to a state wherein a database containing one or more virtual 
test molecules stored to a memory in the computer system. As discussed above, the 
memory can be any type of memory, including RAM or an internal storage device. 

5 

The process then moves to a state wherein the ability of a virtual first test 
molecule to specifically interact with the virtual target molecule is determined, wherein 
the database containing the virtual test molecule, which can be one of a population of 
test molecules, is opened for analysis of the an interaction of the virtual target molecule 
1 0 and virtual test molecule, and the analysis is made. A determination of a specific 

interaction can be made based on calculations performed by software maintained in the 
computer system, or by comparison to a predetermined specific interaction, which can 
be stored in a memory in the computer system and accessed as appropriate. 

1 5 The process then moves to a state wherein, where a specific interaction is 

detected, the virtual test molecule is displayed, or is stored in a second database on the 
computer. If appropriate, the process is repeated for the virtual target molecule and a 
second virtual test molecule, a third virtual test molecule, and so on, as desired. 

20 If a determination is made that a virtual test molecule specifically interacts with 

the virtual target molecule, the identified virtual test molecule is moved from the 
database and can be displayed to the user. This state notifies the user that the molecule 
with the displayed name or structure interacts specifically with the target molecule 
within the constraints that were entered. Once the name of the identified test molecule is 

25 displayed to the user, the process moves to a decision state, wherein a determination is 
made whether more virtual test molecules exist in the database or are to be examined. If 
no more molecules exist in the database, then the process terminates at an end state. 
However, if more test molecules exist in the database, then the process moves to a state, 
wherein a pointer is moved to the next test molecule in the database so that it can be 

30 examined for specific binding activity. In this manner, the new molecule is examined 
for the ability to interact specifically with the virtual target molecule. 
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Such methods as described above can be used in various aspects encompassed 
within the claimed invention. Thus, the methods can be used to identify a peptide 
portion of a promyostatin prodomain that can interact specifically with myostatin and 
reduce or inhibit the ability of myostatin to interact with its receptor or otherwise affect 
5 the ability of the myostatin to effect signal transduction. Similarly, the methods can be 
used to identify small organic molecules that mimic the action of a GDF prodomain, 
thereby reducing or inhibiting myostatin or GDF-1 1 signal transduction. The methods 
also can be used to identify agents that interact specifically with a GDF receptor, for 
example, an Act RIIA, Act RIIB or other GDF receptor, such agents being useful as 
1 0 GDF receptor agonists or antagonists, which can modulate GDF signal transduction in a 
cell. In addition, the methods provide a means to identify previously unknown pro-GDF 
polypeptides or GDF receptors, for example, by identifying conserved structural features 
of the particular polypeptides. 

15 Similar to other members of the TGF-p* superfamily, active GDF peptides are 

expressed as precursor polypeptides, which are cleaved to a mature, biologically 
active form. Accordingly, in still another embodiment, the proteolytic fragment of a 
pro-GDF polypeptide is a mature GDF peptide, or a functional peptide portion of a 
mature GDF peptide, where, as discussed above, the functional peptide portion can 

20 have the activity of a GDF agonist or antagonist. The proteolytic fragment can be a 
mature C-terminal myostatin peptide, which includes about amino acid residues 268 
to 374 of a promyostatin polypeptide (see Figure 1; see, also, Figure 2), or a mature 
C-terminal GDF-1 1 peptide, which includes about amino acid residues 299 to 407 of a 
pro-GDF- 11 polypeptide. Full length mature myostatin peptides are exemplified by 

25 amino acid residues about 268 to 375 as set forth in SEQ ID NO: 4 and SEQ ID 

NO: 6; by amino acid residues about 267 to 374 as set forth in SEQ ID NO: 2, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 8, SEQ ID NO: 18, SEQ ID NO: 14, SEQ 
ID NO: 16, and SEQ ID NO: 20, and by amino acid residues about 49 to 157 of SEQ 
ID NO:-27 and amino acid residues about 28 to 136 of SEQ ID NO: 29. A full length 

30 mature GDF-1 1 peptide is exemplified by amino acid residues about 299 to 407 of 
SEQ ID NO: 25. Functional peptide portions of the mature GDF peptides are 
exemplified by peptide portions of mature myostatin or mature GDF-1 1 that have an 
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agonist or antagonist activity with respect to the activity of a mature GDF peptide. 
Preferably, the mature GDF peptide activity is an ability to interact specifically with 
its receptor. 

As disclosed herein, a mature myostatin peptide (referred to herein generally 
as "myostatin") can induce myostatin signal transduction activity by interacting 
specifically with a myostatin receptor expressed on the surface of a cell (see 
Example 7). Thus, a functional peptide portion of myostatin can be obtained by 
examining peptide portions of a mature myostatin peptide using a method as 
described herein (Example 7) or otherwise known in the art, and identifying 
functional peptide portions of myostatin that specifically interact with a myostatin 
receptor, for example, an activin type IIA receptor (Act RIIA) or Act RIIB receptor 
expressed on a cell. 

A myostatin prodomain can reduce or inhibit myostatin signal transduction 
activity. In one embodiment, the myostatin prodomain can interact specifically with 
myostatin, thereby reducing or inhibiting the ability of the myostatin peptide to 
interact specifically with its receptor. As disclosed herein, a precursor promyostatin 
also lacks the ability to interact specifically with a myostatin receptor, and, therefore, 
mutations in promyostatin that reduce or inhibit the ability of promyostatin to be 
cleaved into mature myostatin provide a means to reduce or inhibit myostatin signal 
transduction. Accordingly, in another embodiment, the present invention provides a 
mutant pro-GDF polypeptide, which contains one or more amino acid mutations that 
disrupt proteolytic cleavage of the mutant pro-GDF to an active mature GDF peptide. 

A mutant pro-GDF polypeptide of the invention can have a mutation that 
affects cleavage at a proteolytic cleavage site such as the consensus proteolytic 
cleavage recognition site Arg-Xaa-Xaa-Arg (SEQ ID NO: 21), which is present in 
pro-GDF polypeptides. Thus, the mutation can be a mutation of an Arg residue of 
SEQ ID NO: 21, such that a mutant promyostatin, for example, cannot be cleaved into 
a myostatin prodomain and a mature myostatin peptide. However, the mutation also 
can be at a site other than the proteolytic cleavage site, and can alter the ability of the 
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protease to bind to the pro-GDF polypeptide so as to effect proteolysis at the cleavage 
site. A mutant pro-GDF polypeptide of the invention, for example, a mutant 
promyostatin or mutant pro-GDF- 1 1 can have a dominant negative activity with 
respect to myostatin or GDF-1 1 and, therefore, can be useful for reducing or 
5 inhibiting myostatin or GDF-1 1 signal transduction in a cell. 

The present invention also provides a substantially purified polynucleotide, 
which encodes a peptide portion of a promyostatin polypeptide or a mutant 
promyostatin, or a peptide portion of a pro-GDF- 1 1 polypeptide or mutant 

10 pro-GDF-11, as described above. As discussed in greater detail below, the invention 
also provides polynucleotides useful as agents for modulating the affect of myostatin 
on a cell, and further provides a polynucleotide encoding a GDF receptor, or 
functional peptide portion thereof. Examples of such polynucleotides are provided in 
the following disclosure. As such, it should be recognized that the following 

15 disclosure is relevant to the various embodiments of the invention as disclosed herein. 

The term "polynucleotide" is used broadly herein to mean a sequence of two 
or more deoxyribonucleotides or ribonucleotides that are linked together by a 
phosphodiester bond. As such, the term "polynucleotide" includes RNA and DNA, 

20 which can be a gene or a portion thereof, a cDNA, a synthetic polydeoxyribonucleic 
acid sequence, or the like, and can be single stranded or double stranded, as well as a 
DNA/RNA hybrid. Furthermore, the term "polynucleotide" as used herein includes 
naturally occurring nucleic acid molecules, which can be isolated from a cell, as well 
as synthetic molecules, which can be prepared, for example, by methods of chemical 

25 synthesis or by enzymatic methods such as by the polymerase chain reaction (PCR). 
In various embodiments, a polynucleotide of the invention can contain nucleoside or 
nucleotide analogs, or a backbone bond other than a phosphodiester bond (see above). 

In general, the nucleotides comprising a polynucleotide are naturally occurring 
30 deoxyribonucleotides, such as adenine, cytosine, guanine or thymine linked to 

2'-deoxyribose, or ribonucleotides such as adenine, cytosine, guanine or uracil linked 
to ribose. However, a polynucleotide also can contain nucleotide analogs, including 
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non-naturally occurring synthetic nucleotides or modified naturally occurring 
nucleotides. Such nucleotide analogs are well known in the art and commercially 
available, as are polynucleotides containing such nucleotide analogs (Lin et al., Nucl. 
Acids Res. 22:5220-5234 (1994); Jellinek et al., Biochemistry 34:11363-11372 
5 (1995); Pagratis et al., Nature Biotechnol. 15:68-73 (1997), each of which is 
incorporated herein by reference). 

The covalent bond linking the nucleotides of a polynucleotide generally is a 
phosphodiester bond. However, the covalent bond also can be any of numerous other 

1 0 bonds, including a thiodiester bond, a phosphorothioate bond, a peptide-like bond or 
any other bond known to those in the art as useful for linking nucleotides to produce 
synthetic polynucleotides (see, for example, Tarn et al., Nucl. Acids Res. 22:977-986 
(1994); Ecker and Crooke, BioTechnology 13:351360 (1995), each of which is 
incorporated herein by reference). The incorporation of non-naturally occurring 

1 5 nucleotide analogs or bonds linking the nucleotides or analogs can be particularly 

useful where the polynucleotide is to be exposed to an environment that can contain a 
nucleolytic activity, including, for example, a tissue culture medium or upon 
administration to a living subject, since the modified polynucleotides can be less 
susceptible to degradation. 

20 

A polynucleotide comprising naturally occurring nucleotides and 
phosphodiester bonds can be chemically synthesized or can be produced using 
recombinant DNA methods, using an appropriate polynucleotide as a template. In 
comparison, a polynucleotide comprising nucleotide analogs or covalent bonds other 
25 than phosphodiester bonds generally will be chemically synthesized, although an 
enzyme such as T7 polymerase can incorporate certain types of nucleotide analogs 
into a polynucleotide and, therefore, can be used to produce such a polynucleotide 
recombinantly from an appropriate template (Jellinek et al, supra, 1995). 

30 Where a polynucleotide encodes a peptide, for example, a peptide portion of 

promyostatin or a peptide agent, the coding sequence generally is contained in a 
vector and is operatively linked to appropriate regulatory elements, including, if 
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desired, a tissue specific promoter or enhancer. The encoded peptide can be further 
operatively linked, for example, to peptide tag such as a His-6 tag or the like, which 
can facilitate identification of expression of the agent in the target cell. A 
polyhistidine tag peptide such as His-6 can be detected using a divalent cation such as 
5 nickel ion, cobalt ion, or the like. Additional peptide tags include, for example, a FLAG 
epitope, which can be detected using an anti-FLAG antibody (see, for example, 
Hopp et al., BioTechnology 6:1204 (1988); U.S. Patent No. 5,01 1,912, each of which 
is incorporated herein by reference); a c-myc epitope, which can be detected using an 
antibody specific for the epitope; biotin, which can be detected using streptavidin or 
10 avidin; and glutathione S -transferase, which can be detected using glutathione. Such 
tags can provide the additional advantage that they can facilitate isolation of the 
operatively linked peptide or peptide agent, for example, where it is desired to obtain 
a substantially purified peptide corresponding to a proteolytic fragment of a myostatin 
polypeptide. 

15 

As used herein, the term "operatively linked" or "operatively associated" 
means that two or more molecules are positioned with respect to each other such that 
they act as a single unit and effect a function attributable to one or both molecules or a 
combination thereof. For example, a polynucleotide sequence encoding a peptide of 

20 the invention can be operatively linked to a regulatory element, in which case the 

regulatory element confers its regulatory effect on the polynucleotide similarly to the 
way in which the regulatory element would effect a polynucleotide sequence with 
which it normally is associated with in a cell. A first polynucleotide coding sequence 
also can be operatively linked to a second (or more) coding sequence such that a 

25 chimeric polypeptide can be expressed from the operatively linked coding sequences. 
The chimeric polypeptide can be a fusion polypeptide, in which the two (or more) 
encoded peptides are translated into a single polypeptide, i.e., are covalently bound 
through a peptide bond; or can be translated as two discrete peptides that, upon 
translation, can operatively associate with each other to form a stable complex. 

30 

A chimeric polypeptide generally demonstrates some or all of the 
characteristics of each of its peptide components. As such, a chimeric polypeptide 
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can be particularly useful in performing methods of the invention, as disclosed herein. 
For example, in one embodiment, a method of the invention can modulate myostatin 
signal transduction in a cell. Thus, where one peptide component of a chimeric 
polypeptide encodes a cell compartment localization domain and a second peptide 
5 component encodes a dominant negative Smad polypeptide, the functional chimeric 
polypeptide can be translocated to the cell compartment designated by the cell 
compartment localization domain and can have the dominant negative activity of the 
Smad polypeptide, thereby modulating myostatin signal transduction in the cell. 

1 0 Cell compartmentalization domains are well known and include, for example, a 

plasma membrane localization domain, a nuclear localization signal, a mitochondrial 
membrane localization signal, an endoplasmic reticulum localization signal, or the like 
(see, for example, Hancock et al, EMBO J. 10:4033-4039, 1991; Buss et al., Mol. 
Cell. Biol. 8:3960-3963, 1988; U.S. Patent No. 5,776,689 each of which is 

1 5 incorporated herein by reference). Such a domain can be useful to target an agent to a 
particular compartment in the cell, or to target the agent for secretion from a cell. For 
example, the kinase domain of a myostatin receptor such as Act RIIB generally is 
associated with the inner surface of the plasma membrane. Thus, a chimeric 
polypeptide comprising a dominant negative myostatin receptor kinase domain, for 

20 example, a dominant negative Act RIIB receptor, which lacks kinase activity, can 
further comprise a plasma membrane localization domain, thereby localizing the 
dominant negative Act RIIB kinase domain to the inner ceil membrane. 

As disclosed herein, a pro-GDF signal peptide has cellular localization activity. 

25 As used herein, the term "cellular localization activity" refers to the ability of a signal 
peptide to direct translocation of a peptide operably linked thereto to one or more 
specific intracellular compartments or to direct secretion of the molecule from the cell. 
As such, a pro-GDF signal peptide can be particularly useful for directing translocation 
of a peptide or other agent operably linked to the signal peptide to the same intracellular 

30 compartments as a naturally expressed GDF having substantially the same signal 

peptide: Furthermore, the signal peptide, for example, a promyostatin signal peptide 
comprising about the first 1 5 to 30 amino acids of promyostatin, can direct secretion of 
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an operably linked agent from the cell through the same pathway as the naturally 
occurring pro-GDF having comprising the signal peptide. Thus, particularly useful 
agents for performing a method of the invention include a GDF prodomain or functional 
peptide portion thereof that is operably linked to a GDF signal peptide, preferably a 
5 promyostatin or pro-GDF-1 1 signal peptide. 

A polynucleotide of the invention, including a polynucleotide agent useful in 
performing a method of the invention, can be contacted directly with a target cell. For 
example, oligonucleotides useful as antisense molecules, ribozymes, or triplexing 

10 agents can be directly contacted with a target cell, whereupon the enter the cell and 
effect their function. A polynucleotide agent also can interact specifically with a 
polypeptide, for example, a myostatin receptor (or myostatin), thereby altering the 
ability of myostatin to interact specifically with the receptor. Such polynucleotides, 
as well as methods of making and identifying such polynucleotides, are disclosed 

15 herein or otherwise well known in the art (see, for example, O'Connell et al., Proc. 
Natl. Acad. Sci.. USA 93:5883-5887, 1996; Tuerk and Gold, Science 249:505-510, 
1990; Gold et al., Ann. Rev. Biochem. 64:763-797, 1995; each of which is 
incorporated herein by reference). 

20 A polynucleotide of the invention, which can encode a peptide portion of a 

pro-GDF polypeptide such as promyostatin, or can encode a mutant promyostatin . 
polypeptide, or can encode a GDF receptor or functional peptide portion thereof, or 
can be a polynucleotide agent useful in performing a method of the invention, can be 
contained in a vector, which can facilitate manipulation of the polynucleotide, 

25 including introduction of the polynucleotide into a target cell. The vector can be a 
cloning vector, which is useful for maintaining the polynucleotide, or can be an 
expression vector, which contains, in addition to the polynucleotide, regulatory 
elements useful for expressing the polynucleotide and, where the polynucleotide 
encodes a peptide, for expressing the encoded peptide in a particular cell. An 

30 expression vector can contain the expression elements necessary to achieve, for 
example, sustained transcription of the encoding polynucleotide, or the regulatory 
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elements can be operatively linked to the polynucleotide prior to its being cloned into 
the vector. 

An expression vector (or the polynucleotide) generally contains or encodes a 
5 promoter sequence, which can provide constitutive or, if desired, inducible or tissue 
specific or developmental stage specific expression of the encoding polynucleotide, a 
poly -A recognition sequence, and a ribosome recognition site or internal ribosome 
entry site, or other regulatory elements such as an enhancer, which can be tissue 
specific. The vector also can contain elements required for replication in a 

1 0 prokaryotic or eukaryotic host system or both, as desired. Such vectors, which 
include plasmid vectors and viral vectors such as bacteriophage, baculovirus, 
retrovirus, lentivirus, adenovirus, vaccinia virus, semliki forest virus and 
adeno-associated virus vectors, are well known and can be purchased from a 
commercial source (Promega, Madison WI; Stratagene, La Jolla CA; GIBCO/BRL, 

1 5 Gaithersburg MD) or can be constructed by one skilled in the art (see, for example, 
Metfa. Enzymol. . Vol. 185, Goeddel, ed. (Academic Press, Inc., 1990); Jolly, Cane. 
Gene Ther. 1:51-64, 1994; Flotte. J. Bioenerg. Biomemb. 25:37-42. 1993; 
Kirshenbaum et al, J. Clin. Invest. 92:381-387, 1993; each of which is incorporated 
herein by reference). 

20 

A tetracycline (tet) inducible promoter can be particularly useful for driving 
expression of a polynucleotide of the invention, for example, a polynucleotide 
encoding a dominant negative form of myostatin, in which the proteolytic processing 
site has been mutated, or encoding a myostatin prodomain, which can form a complex 

25 with a mature myostatin peptide, or encoding a dominant negative form of a GDF 
receptor. Upon administration of tetracycline, or a tetracycline analog, to a subject 
containing a polynucleotide operatively linked to a tet inducible promoter, expression 
of the encoded peptide is induced, whereby the peptide can effect its activity, for 
example, whereby a peptide agent can reduce or inhibit myostatin signal transduction. 

30 Such a method can be used, for example, to induce muscle hypertrophy in an adult 
organism. 
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The polynucleotide also can be operatively linked to tissue specific regulatory 
element, for example, a muscle cell specific regulatory element, such that expression 
of an encoded peptide is restricted to the muscle cells in an individual, or to muscle 
cells in a mixed population of cells in culture, for example, an organ culture. Muscle 
5 cell specific regulatory elements including, for example, the muscle creatine kinase 
promoter (Sternberg et al., Mol. Cell. Rinl. 8:2896-2909, 1988, which is incorporated 
herein by reference) and the myosin light chain enhancer/promoter (Donoghue et al, 
Proc. Natl. Acad. Set.. USA 88:5847-5851, 1991, which is incorporated herein by 
reference) are well known in the art. 

10 

Viral expression vectors can be particularly useful for introducing a 
polynucleotide into a cell, particularly a cell in a subject. Viral vectors provide the 
advantage that they can infect host cells with relatively high efficiency and can infect 
specific cell types. For example, a polynucleotide encoding a myostatin prodomain or 

1 5 functional peptide portion thereof can be cloned into a baculovirus vector, which then 
can be used to infect an insect host cell, thereby providing a means to produce large 
amounts of the encoded prodomain. The viral vector also can be derived from a virus 
that infects cells of an organism of interest, for example, vertebrate host cells such as 
mammalian, avian or piscine host cells. Viral vectors can be particularly useful for 

20 introducing a polynucleotide useful in performing a method of the invention into a 
target cell. Viral vectors have been developed for use in particular host systems, 
particularly mammalian systems and include, for example, retroviral vectors, other 
lentivirus vectors such as those based on the human immunodeficiency virus (HIV), 
adenovirus vectors, adeno-associated virus vectors, herpesvirus vectors, vaccinia virus 

25 vectors, and the like (see Miller and Rosman, BioTechniques 7:980-990, 1 992; 

Anderson et al., Nature 392:25-30 Suppl, 1998; Verma and Somia, Nature 389:239- 
242, 1997; Wilson, New Engl. J. Med. 334:1 185-1 187 (1996), each of which is 
incorporated herein by reference). 

30 When retroviruses, for example, are used for gene transfer, replication 

competent retroviruses theoretically can develop due to recombination of retroviral 
vector and viral gene sequences in the packaging cell line utilized to produce the 
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retroviral vector. Packaging cell lines in which the production of replication 
competent virus by recombination has been reduced or eliminated can be used to 
minimize the likelihood that a replication competent retrovirus will be produced. All 
retroviral vector supernatants used to infect cells are screened for replication 
5 competent virus by standard assays such as PCR and reverse transcriptase assays. 
Retroviral vectors allow for integration of a heterologous gene into a host cell 
genome, which allows for the gene to be passed to daughter cells following cell 
division. 

10 A polynucleotide, which can be contained in a vector, can be introduced into a 

cell by any of a variety of methods known in the art (Sambrook et al., Molecular 
Cloning: A laboratory manual (Cold Spring Harbor Laboratory Press 1989); Ausubel 
et al., Current Protocols in M olecular Biology . John Wiley and Sons, Baltimore, MD 
(1987, and supplements through 1995), each of which is incorporated herein by 

1 5 reference). Such methods include, for example, transfection, lipofection, 

microinjection, electroporation and, with viral vectors, infection; and can include the 
use of liposomes, microemulsions or the like, which can facilitate introduction of the 
polynucleotide into the cell and can protect the polynucleotide from degradation prior 
to its introduction into the cell. The selection of a particular method will depend, for 

20 example, on the cell into which the polynucleotide is to be introduced, as well as 
whether the cell is isolated in culture, or is in a tissue or organ in culture or in situ. 

Introduction of a polynucleotide into a cell by infection with a viral vector is 
particularly advantageous in that it can efficiently introduce the nucleic acid molecule 

25 into a cell ex vivo or in vivo (see, for example, U.S. Patent No. 5,399,346, which is 

incorporated herein by reference). Moreover, viruses are very specialized and can be 
selected as vectors based on an ability to infect and propagate in one or a few specific 
cell types. Thus, their natural specificity can be used to target the nucleic acid 
molecule contained in the vector to specific cell types. As such, a vector based on an 

30 HIV can be used to infect T cells, a vector based on an adenovirus can be used, for 
example, to infect respiratory epithelial cells, a vector based on a herpesvirus can be 
used to infect neuronal cells, and the like. Other vectors, such as adeno-associated 
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viruses can have greater host cell range and, therefore, can be used to infect various 
cell types, although viral or non-viral vectors also can be modified with specific 
receptors or ligands to alter target specificity through receptor mediated events. 

The present invention also provides antibodies that specifically bind a peptide 
portion of a promyostatin polypeptide or a mutant promyostatin polypeptide. 
Particularly useful antibodies of the invention include antibodies that specifically bind 
a myostatin prodomain, or a functional peptide portion thereof, and antibodies that 
bind a promyostatin polypeptide and reduce or inhibit proteolytic cleavage of the 
promyostatin to a mature myostatin peptide. In addition, an antibody of the invention 
can be an antibody that specifically binds a GDF receptor, or functional peptide 
portion thereof, as described below. Methods of preparing and isolating an antibody 
of the invention are described in greater detail below, the disclosure of which is 
incorporated herein by reference. 

Myostatin is essential for proper regulation of skeletal muscle mass. As 
compared to wild type mice, myostatin knock-out mice, which lack myostatin, have 
two to three times the amount of muscle due to a combination of hyperplasia and 
hypertrophy. As disclosed herein, myostatin knock-out mice also have a dramatic 
reduction in fat accumulation due, at least in part, to an increased anabolic state of 
skeletal muscle tissue throughout the body. Conversely, overexpression of myostatin 
in nude mice induced a wasting syndrome that resembles the cachectic state observed 
in human patients suffering from chronic diseases such as cancer or AIDS. As further 
disclosed herein, myostatin activity can be mediated through a signal transduction 
having the characteristics of the Smad signal transduction pathway. Accordingly, the 
invention provides methods of modulating an effect of myostatin on a cell by 
contacting the cell with an agent that affects myostatin signal transduction in the cell. 

As used herein, the term "modulate," when used in reference to an effect of 
myostatin on a cell, means that myostatin signal transduction in the cell either is 
increased or is reduced or inhibited. The terms "increase" and "reduce or inhibit" are 
used in reference to a baseline level of myostatin signal transduction activity, which 
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can be the level of activity of the signal transduction pathway in the absence of 
myostatin, or the level of activity in a normal cell in the presence of myostatin. For 
example, the myostatin signal transduction pathway exhibits a particular activity in a 
muscle cell contacted with myostatin, and, upon further contacting the muscle cell 
5 with a myostatin prodomain, myostatin signal transduction activity can be reduced or 
inhibited. As such, a myostatin prodomain is an agent useful for reducing or 
inhibiting myostatin signal transduction. Similarly, a prodomain of another GDF 
family member such as a GDF-1 1 prodomain, or of another TGF-p family member 
such as an activin prodomain, MIS prodomain, or the like, can be useful for reducing 

10 myostatin signal transduction. The terms "reduce or inhibit" are used together herein 
because it is recognized that, in some cases, the level of myostatin signal transduction 
can be reduced below a level that can be detected by a particular assay. As such, it 
may not be determinable using such an assay as to whether a low level of myostatin 
signal transduction remains, or whether the signal transduction is completely 

1 5 inhibited. 

As used herein, the term "myostatin signal transduction" refers to the series of 
events, generally a series of protein-protein interactions, that occurs in a cell due to 
the specific interaction of myostatin with a myostatin receptor expressed on the 

20 surface of the cell. As such, myostatin signal transduction can be detected, for 

example, by detecting a specific interaction of myostatin with its receptor on a cell, by 
detecting phosphorylation of one or more polypeptides involved in a myostatin signal 
transduction pathway in the cell, by detecting expression of one or more genes that 
are specifically induced due to myostatin signal transduction, or by detecting a 

25 phenotypic change that occurs in response to myostatin signal transduction (see 

Examples). As disclosed herein, an agent useful in a method of the invention can act 
as an agonist to stimulate myostatin signal transduction or as an antagonist to reduce 
or inhibit myostatin signal transduction. 

30 The methods of the present invention are exemplified generally herein with 

respect to myostatin. It should be recognized, however, that the methods of the 
invention can more broadly encompass modulating an effect of other GDF peptides, 
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for example, GDF-1 1, on a cell by contacting the cell with an agent that affects signal 
transduction due to the GDF in the cell. Methods of practicing the full scope of the 
invention will be readily known in view of the present disclosure, which includes, for 
example, methods for identifying GDF receptors, methods for identifying agents that 
5 modulate signal transduction due to a specific interaction of the GDF with its 
receptor, and the like. 

A myostatin signal transduction pathway is exemplified herein by the Smad 
pathway, which is initiated upon myostatin specifically interacting with the 

1 0 extracellular domain of an activin type II receptor and propagated through interactions 
of intracellular polypeptides, including Smad proteins, in the cell. In general, 
myostatin signal transduction is associated with phosphorylation or dephosphorylation 
of specific intracellular polypeptides such as Smad polypeptides. Thus, myostatin 
signal transduction in a cell can be detected by detecting an increased level of 

1 5 phosphorylation of one or more Smad polypeptides in the presence of myostatin as 
compared to the level of phosphorylation of the polypeptides in the absence of 
myostatin. A method of the invention provides a means to increase or decrease 
myostatin signal transduction and, therefore, the level of phosphorylation of an Smad 
polypeptide involved in a myostatin signal transduction pathway will be increased 

20 above a normal level or decreased below a level expected in the presence of 
myostatin, respectively. 

A method of the invention can be performed, for example, by contacting under 
suitable conditions a target cell and an agent that affects myostatin signal transduction 

25 in the cell. Suitable conditions can be provided by placing the cell, which can be an 
isolated cell or can be a component of a tissue or organ, in an appropriate culture 
medium, or by contacting the cell in situ in an organism. For example, a medium 
containing the cell can be contacted with an agent the affects the ability of myostatin 
to specifically interact with a myostatin receptor expressed on the cell, or with an 

30 agent that affects a myostatin signal transduction pathway in the cell. In general, the 
cell is a component of a tissue or organ in a subject, in which case contacting the cell 
can comprise administering the agent to the subject. However, the cell also can be 
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manipulated in culture, then can be maintained in culture, administered to a subject, or 
used to produce a transgenic nonhuman animal. 

An agent useful in a method of the invention can be any type of molecule, for 
5 example, a polynucleotide, a peptide, a peptidomimetic, peptoids such as vinylogous 
peptoids, a small organic molecule, or the like, and can act in any of various ways to 
affect myostatin signal transduction. The agent can act extracellularly by binding 
myostatin or a myostatin receptor such as an activin receptor, thereby altering the 
ability of myostatin to specifically interact with its receptor, or can act intracellularly 

10 to alter myostatin signal transduction in the cell. In addition, the agent can be an 

agonist, which mimics or enhances the effect of myostatin on a cell, for example, the 
ability of myostatin to specifically interact with its receptor, thereby increasing 
myostatin signal transduction in the cell; or can be an antagonist, which can reduces 
or inhibits the effect of myostatin on a cell, thereby reducing or inhibiting myostatin 

1 5 signal transduction in the cell. 

As used herein, the term "specific interaction" or "specifically binds" or the 
like means that two molecules form a complex that is relatively stable under 
physiologic conditions. The term is used herein in reference to various interactions, 

20 including, for example, the interaction of myostatin and a myostatin receptor, the 
interaction of the intracellular components of a myostatin signal transduction 
pathway, the interaction of an antibody and its antigen, and the interaction of a 
myostatin prodomain with myostatin. A specific interaction can be characterized by a 
dissociation constant of at least about 1 x 10" 6 M, generally at least about 1 x 10" 7 M, 

25 usually at least about 1 x 10" 8 M, and particularly at least about 1 x 10" 9 M or 

1 x 10" 10 M or greater. A specific interaction generally is stable under physiological 
conditions, including, for example, conditions that occur in a living individual such as 
a human or other vertebrate or invertebrate, as well as conditions that occur in a cell 
culture such as used for maintaining mammalian cells or cells from another vertebrate 

30 organism or an invertebrate organism. In addition, a specific interaction such as the 
extracellular interaction of a myostatin prodomain and myostatin generally is stable 
under conditions such as those used for aquaculture of a commercially valuable 
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marine organism. Methods for determining whether two molecules interact 
specifically are well known and include, for example, equilibrium dialysis, surface 
plasmon resonance, and the like. 

An agent that alters a specific interaction of myostatin with its receptor can 
act, for example, by binding to myostatin such that it cannot interact specifically with 
its cellular receptor, by competing with myostatin for binding to its receptor, or by 
otherwise by-passing the requirement that myostatin specifically interact with its 
receptor in order to induce myostatin signal transduction. A truncated myostatin 
receptor such as a soluble extracellular domain of a myostatin receptor is an example 
of an agent that can bind myostatin, thereby sequestering myostatin and reducing or 
inhibiting its ability to interact specifically with a cell surface myostatin receptor. A 
myostatin prodomain or functional peptide portion thereof is another example of an 
agent that can bind myostatin, thereby reducing or inhibiting the ability of the 
myostatin to interact specifically with a cell surface myostatin receptor. Such 
myostatin antagonists are useful in practicing a method of the invention, particularly 
for reducing or inhibiting myostatin signal transduction in a cell. 

Follistatin is another example of an agent that can bind to myostatin, thereby 
reducing or inhibiting the ability of myostatin to interact specifically with its receptor. 
Follistatin can bind to and inhibit the activity of various TGF-B family members, 
including myostatin (GDF-8; U.S. Patent No. 6,004,937) and GDF-1 1 (Gamer et al, 
Devel. Biol. 208:222-232, 1999) and, therefore, can be used for performing a method 
as disclosed. Although the use of follistatin for modulating the effects of myostatin 
previously has been described (U.S. Patent No. 6,004,937), it was not known, prior to 
the present disclosure, that follistatin reduces or inhibits the ability of myostatin to 
interact specifically with a myostatin receptor such as Act RIIB. 

An agent useful in a method of the invention also can interact with a cellular 
myostatin receptor, thereby competing with myostatin for the receptor. Such an agent 
can be, for example, an antibody that specifically binds a cell surface myostatin 
receptor, including all or a portion of the myostatin binding domain, thereby 
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preventing myostatin from interacting specifically with the receptor. Such an 
anti-myostatin receptor antibody can be selected for its ability to specifically bind the 
receptor without activating myostatin signal transduction and, therefore, can be useful 
as a myostatin antagonist for reducing or inhibiting myostatin signal transduction; or 
5 can be selected for its ability to specifically bind the receptor and activate myostatin 
signal transduction, thus acting as a myostatin agonist. The antibody can be raised 
using a myostatin receptor, or the extracellular domain of the receptor, as an 
immunogen, or can be an anti-idiotype antibody, which is raised against an 
anti-myostatin antibody and mimics myostatin. Anti-GDF receptor antibodies are 
1 0 discussed in greater detail below. 

An agent useful in a method of the invention also can be an agent that reduces 
or inhibits proteolytic cleavage of a pro-GDF polypeptide to an active mature GDF 
peptide, thereby reducing or inhibiting GDF signal transduction. Such an agent can 

15 be a protease inhibitor, particularly one that inhibits the activity of a protease that 
recognizes and cleaves an Arg-Xaa-Xaa-Arg (SEQ ID NO: 21) proteolytic 
recognition site. Where the pro-GDF is promyostatin, an anti-myostatin antibody that 
reduces or inhibits the specific binding of a protease to the Arg-Xaa-Xaa-Arg (SEQ 
ID NO: 21) proteolytic cleavage site in myostatin also can be used reduce or inhibit 

20 proteolysis of promyostatin, thereby reducing the amount of mature myostatin 

produced. Such an antibody can bind the proteolytic cleavage site, or can bind some 
other site on the pro-GDF polypeptide such that binding of and cleavage by the 
protease is reduced or inhibited. 

25 In addition, an agent useful in a method of the invention can be a mutant 

myostatin receptor, which, for example, lacks myostatin signal transduction activity in 
response to myostatin binding, or has constitutive myostatin signal transduction 
activity. For example, a mutant myostatin receptor can have a point mutation, a 
deletion, or the like in its kinase domain such that the receptor lacks kinase activity. 

30 Such a dominant negative mutant myostatin receptor lacks the ability to transmit 

myostatin signal transduction despite the fact that it can specifically bind myostatin. 
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An agent useful in a method of the invention also can modulate the level or 
activity of an intracellular polypeptide involved in a myostatin signal transduction 
pathway. As disclosed herein, regulation of muscle growth by myostatin can involve 
components of a signal transduction pathway that is activated by activin type II 
5 receptors (see Examples 7 and 9; see, also, Example 14). Myostatin specifically 

interacts with activin type IIB receptors (Act RIIB) expressed in COS cells in culture 
(Example 7). The low affinity of binding indicates that myostatin binding to 
Act RIIB in vivo may involve other factors, similar to TGF-B, which has significantly 
higher affinity for the type II receptor when the type I receptor also is present 
10 (Attisano et al, Ceil 75:671-680, 1993), or to other systems that require other 

molecules for presenting the ligand to the signaling receptor (Massague, supra, 1998; 
Wang et al., Cejl 67:795-805, 1991). 

The specific interaction of myostatin with Act RIIB indicates that myostatin 
15 signal transduction can involve components of the Smad signal transduction pathway. 
Thus, the Smad signal transduction pathway provides a target for modulating the 
effect of myostatin on a cell, and agents that affect the Smad pathway can be useful 
for modulating myostatin signal transduction in a cell. 

20 Agents useful for modulating the level or activity of intracellular polypeptide 

components of a GDF signal transduction include agonists, which can increase signal 
transduction activity, and antagonists, which can reduce or inhibit signal transduction 
activity. With respect to myostatin, for example, agents that can increase myostatin 
signal transduction activity are exemplified by phosphatase inhibitors, which can 

25 reduce or inhibit dephosphorylation of Smad polypeptides, thereby prolonging the 
signal transducing activity of the Smad. Dominant negative Smad 6 or Smad 7 
polypeptides, which can negate the inhibitory effect of Smad 6 and Smad 7 on 
myostatin signal transduction, are additional examples of agents that can increase 
myostatin signal transduction activity by increasing Smad signal transduction. 

30 

Antagonist agents that can reduce or inhibit myostatin signal transduction 
activity are exemplified by dominant negative Smad polypeptides such as dominant 
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negative Smad 2, Smad 3 or Smad 4, in which the C-terminal phosphorylation sites 
have been mutated. The inhibitory Smad polypeptides such as Smad 6 and Smad 7, 
which inhibit the activation of Smad 2 and Smad 3; and a c-ski polypeptide, which 
binds Smad polypeptides and inhibits signal transduction, are additional examples of 
5 antagonists useful for reducing or inhibiting myostatin signal transduction by 
decreasing Smad signal transduction. 

Where the agent that acts intracellularly is a peptide, it can be contacted with 
the cell directly, or a polynucleotide encoding the peptide (or polypeptide) can be 

10 introduced into the cell and the peptide can be expressed in the cell. It is recognized 
that some of the peptides useful in a method of the invention are relatively large and, 
therefore, may not readily traverse a cell membrane. However, various methods are 
known for introducing a peptide into a cell. The selection of a method for introducing 
such a peptide into a cell will depend, in part, on the characteristics of the target cell, into 

1 5 which the polypeptide is to be provided. For example, where the target cells, or a few 
cell types including the target cells, express a receptor, which, upon binding a particular 
ligand, is internalized into the cell, the peptide agent can be operatively associated with 
the ligand. Upon binding to the receptor, the peptide is translocated into the cell by 
receptor-mediated endocytosis. The peptide agent also can be encapsulated in a 

20 liposome or formulated in a lipid complex, which can facilitate entry of the peptide into 
the cell, and can be further modified to express a receptor (or ligand), as above. The 
peptide agent also can be introduced into a cell by engineering the peptide to contain a 
protein transduction domain such as the human immunodeficiency virus TAT protein 
transduction domain, which facilitates translocation of the peptide into the cell (see 

25 Schwarze et al., Science 285:1569-1572 (1999), which is incorporated herein by 
reference; see, also, Derossi et al., J. Biol. Chem. 271:18188 (1996)). 

The target cell also can be contacted with a polynucleotide encoding the 
peptide agent, which can be expressed in the cell. The expressed peptide agent can be 
30 a mutant GDF receptor or peptide portion thereof. Example of a mutant GDF 

receptor include a kinase-deficient form of a myostatin receptor such as a dominant 
negative Act RIIA or Act RIIB, which can, but need not, have the ability to 
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specifically bind a ligand (e.g., myostatin); and a truncated myostatin or other GDF 
receptor, such as a soluble form of a myostatin receptor, which binds myostatin, 
thereby sequestering it from interacting specifically with a cellular myostatin receptor; 
a dominant-negative form of a Smad polypeptide such as a dominant negative 
Smad 3, in which the C-terminal phosphorylation sites have been mutated (Liu et al., 
Proc. Natl. Acad. Sci., USA 94:10669-10674, 1997); a Smad 7 polypeptide, which 
inhibits the activation of Smad 2 and Smad 3 (Heldin et al., Nature 390:465-471, 
1997); or a c-ski polypeptide, which can bind a Smad polypeptide and inhibit signal 
transduction by the Smad (Sutrave et al., Genes Devel. 4:1462-1472, 1990). 

Expression of a c-ski peptide agent in a cell can be particularly useful for 
modulating myostatin signal transduction. Mice lacking c-ski show a severe 
reduction in skeletal muscle mass (Berk et al., Genes Devel. 1 1:2029-2039, 1997), 
whereas transgenic mice that overexpress c-ski in muscle show dramatic muscle 
hypertrophy (Sutrave et al., supra, 1 990). c-ski interacts with and blocks the activity 
of certain Smad proteins, including Smad 2, Smad 3 and Smad 4, which mediate 
signaling of TGF-B and activin type II receptors (Luo et al., Genes Devel. 
13:2196-1106, 1999; Stroschein et al., Science 286:771-774, 1999; Sunetal.,MoL 
Cell 4:499-509, 1999a; Sun et al., Proc. Natl. Acad. Sci.. USA 96:1 12442-12447, 
1999b; Akiyoshi et al., J. Biol. Chem. 274:35269, 1999). Thus, in view of the present 
disclosure that myostatin activity can be mediated through Act RIIB binding, it will 
be recognized that the activity of myostatin, or of any GDF that utilizes an Smad 
pathway, can be modulated by increasing or decreasing c-ski expression in a target 
cell. - 

An agent useful in a method of the invention can be a polynucleotide, which 
can be contacted with or introduced into a cell as described above. Generally, but not 
necessarily, the polynucleotide is introduced into the cell, where it effects its function 
either directly, or following transcription or translation or both. For example, as 
discussed above, the polynucleotide can encode a peptide agent, which is expressed in 
the cell and modulates myostatin activity. Such an expressed peptide can be, for 
example, a mutant promyostatin polypeptide, which cannot be cleaved into active 

GrayCary\GT\6209497.1 
104659-160348 



51 



myostatin; or can be a mutant my o statin receptor, for example, a truncated myostatin 
receptor extracellular domain; a myostatin receptor extracellular domain operatively 
associated with a membrane anchoring domain; or a mutant myostatin receptor 
lacking protein kinase activity. Methods for introducing a polynucleotide into a cell 
5 are exemplified below or otherwise known in the art. 

A polynucleotide agent useful in a method of the invention also can be, or can 
encode, an antisense molecule, a ribozyme or a triplexing agent. For example, the 
polynucleotide can be (or can encode) an antisense nucleotide sequence such as an 

10 antisense c-ski nucleotide sequence, which can act as an agonist to increase myostatin 
signal transduction in a cell; or an antisense Smad nucleotide sequence, which can act 
either as an agonist to increase myostatin signal transduction or as antagonist to 
reduce or inhibit myostatin signal transduction, depending on the particular Smad 
antisense nucleotide sequence. Such polynucleotides can be contacted directly with a 

15 target cell and, upon uptake by the cell, can effect their antisense, ribozyme or 

triplexing activity; or can be encoded by a polynucleotide that is introduced into a 
cell, whereupon the polynucleotide is expressed to produce, for example, an antisense 
RNA molecule or ribozyme, which effects its activity. 

20 An antisense polynucleotide, ribozyme or triplexing agent is complementary 

to a target sequence, which can be a DNA or RNA sequence, for example, messenger 
RNA, and can be a coding sequence, a nucleotide sequence comprising an intron-exon 
junction, a regulatory sequence such as a Shine-Delgarno sequence, or the like. The 
degree of complementarity is such that the polynucleotide, for example, an antisense 

25 polynucleotide, can interact specifically with the target sequence in a cell. Depending 
on the total length of the antisense or other polynucleotide, one or a few mismatches 
with respect to the target sequence can be tolerated without losing the specificity of 
the polynucleotide for its target sequence. Thus, few if any mismatches would be 
tolerated in an antisense molecule consisting, for example, of 20 nucleotides, whereas 

30 several mismatches will not affect the hybridization efficiency of an antisense 

molecule that is complementary, for example, to the full length of a target mRNA 
encoding a cellular polypeptide. The number of mismatches that can be tolerated can 
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be estimated, for example, using well known formulas for determining hybridization 
kinetics (see Sambrook et al., supra, 1989) or can be determined empirically using 
methods as disclosed herein or otherwise known in the art, particularly by 
determining that the presence of the antisense polynucleotide, ribozyme, or triplexing 
5 agent in a cell decreases the level of the target sequence or the expression of a 
polypeptide encoded by the target sequence in the cell. 

A polynucleotide useful as an antisense molecule, a ribozyme or a triplexing 
agent can inhibit translation or cleave the nucleic acid molecule, thereby modulating 

1 0 myostatin signal transduction in a cell. An antisense molecule, for example, can bind 
to an mRNA to form a double stranded molecule that cannot be translated in a cell. 
Antisense oligonucleotides of at least about 1 5 to 25 nucleotides are preferred since 
they are easily synthesized and can hybridize specifically with a target sequence, 
although longer antisense molecules can be expressed from a polynucleotide 

1 5 introduced into the target cell. Specific nucleotide sequences useful as antisense 
molecules can be identified using well known methods, for example, gene walking 
methods (see, for example, Seimiya et al, J. Biol. Chem. 272:4631-4636 (1997), 
which is incorporated herein by reference). Where the antisense molecule is 
contacted directly with a target cell, it can be operatively associated with a chemically 

20 reactive group such as iron-linked EDTA, which cleaves a target RNA at the site of 
hybridization. A triplexing agent, in comparison, can stall transcription (Maher et al., 
Antisense Res. Devel. 1:227 (1991); Helene, Anticancer Drug Design 6:569 (1991)). 
Thus, a triplexing agent can be designed to recognize, for example, a sequence of a 
Smad gene regulatory element, thereby reducing or inhibiting the expression of a 

25 Smad polypeptide in the cell, thereby modulating myostatin signal transduction in a 
target cell. 

The present invention also provides a method of identifying an agent that can 
alter the effect of a GDF such myostatin on a cell, particularly agents that can alter the 
30 ability of the GDF to interact specifically with its cellular receptor. Such agents can 
act by increasing or decreasing the ability of the GDF to interact specifically with its 
receptor and, therefore, are useful for increasing or decreasing GDF signal 
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transduction, respectively. A screening method of the invention is exemplified herein 
using a myostatin receptor, for example, an activin type II receptor such as Act RIIA 
or Act RIIB. 

5 A screening method of the invention can be performed, for example, by 

contacting under suitable conditions myostatin, or a functional peptide portion 
thereof, a myostatin receptor such as Act RIIA or Act RIIB, and an agent to be tested. 
The myostatin, the receptor and the agent can be contacted in any order as desired. 
As such, the screening method can be used to identify agents that can competitively or 

1 0 non-competitively inhibit myostatin binding to the receptor, agents that can mediate 
or enhance myostatin binding to the receptor, agents that can induce dissociation of 
specifically bound myostatin from the receptor, and agents that otherwise affect the 
ability of myostatin to induce signal transduction, such agents having agonist or 
antagonist activity. Appropriate control reactions are performed to confirm that the 

1 5 action of the agent is specific with respect to the myostatin or other GDF receptor. 

Suitable conditions for performing a screening method of the invention can be 
any conditions that allow myostatin to specifically interact with its receptor, including 
methods as disclosed herein (see Examples 7 and 9) or otherwise known in the art. 

20 Thus, suitable conditions for performing the screening assay can be, for example, 
in vitro conditions using a substantially purified myostatin receptor; cell culture 
conditions, utilizing a cell that normally expresses a myostatin receptor, for example, 
an adipocyte or a muscle cell, or a cell that has been genetically modified to express a 
functional myostatin receptor on its surface; or in situ conditions as occur in an 

25 organism. 

A screening method of the invention also can be performed using the methods 
of molecular modeling as described above. The utilization of a molecular modeling 
method provides a convenient, cost effective means to identify those agents, among a 
30 large population such as a combinatorial library of potential agents, that are most 
likely to interact specifically with a GDF receptor, thereby reducing the number of 
potential agents that need to be screened using a biological assay. Upon identifying 

Gray Cary\GT\6209497. 1 
104659-160348 



54 



agents that interact specifically with a GDF receptor such as Act RIIB using a 
molecular modeling method, the selected agents can be examined for the ability to 
modulate an effect of a GDF such as myostatin on a cell using the methods disclosed 
herein. 

5 

The ability of a test agent to modulate an effect of myostatin can be detected 
using methods as disclosed herein (see Examples 7 and 9) or otherwise known in the 
art. The term "test agent" or "test molecule" is used broadly herein to mean any agent 
that is being examined for agonist or antagonist activity in a method of the invention. 
10 Although the method generally is used as a screening assay to identify previously 
unknown molecules that can act as agonist or antagonist agents as described herein, 
the methods also can be used to confirm that a agent known to have a particular 
activity in fact has the activity, for example, in standardizing the activity of the agent. 

15 A method of the invention can be performed, for example, by contacting 

myostatin with a cell that has been genetically modified to express an Act RIIB 
receptor, and determining the effect of an agent, for example, a dominant negative 
Act RIIB, by examining phosphorylation of a Smad polypeptide involved in the 
myostatin signal transduction pathway. If desired, the cell can be further genetically 

20 modified to contain a reporter nucleotide sequence, the expression of which is 

dependent on the myostatin signal transduction pathway, for example, on activation of 
the Smad pathway, and the effect of the test agent can be determined by comparing 
expression of the reporter nucleotide sequence in the presence and absence of the 
agent, the myostatin, or both. Expression of the reporter nucleotide sequence can be 

25 detected, for example, by detecting an RNA transcript of the reporter nucleotide 

sequence, or by detecting a polypeptide encoded by the reporter nucleotide sequence. 
A polypeptide reporter can be, for example, a P-lactamase, chloramphenicol 
acetyltransferase, adenosine deaminase, aminoglycoside phosphotransferase, 
dihydrofolate reductase, hygromycin-B phosphotransferase, thymidine kinase, 

30 |3-galactosidase, luciferase or xanthine guanine phosphoribosyltransferase polypeptide 
or the like, and can be detected, for example, by detecting radioactivity, 
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luminescence, chemiluminescence, fluorescence, enzymatic activity, or specific 
binding due to the reporter polypeptide. 

A screening method of the invention provides the advantage that it can be 
5 adapted to high throughput analysis and, therefore, can be used to screen 
combinatorial libraries of test agents in order to identify those agents that can 
modulate an effect of myostatin on a cell, including those agents that can alter a 
specific interaction of myostatin and a myostatin receptor. Methods for preparing a 
combinatorial library of molecules that can be tested for a desired activity are well 

1 0 known in the art and include, for example, methods of making a phage display library of 
peptides, which can be constrained peptides (see, for example, U.S. Patent 
No. 5,622,699; U.S. Patent No. 5,206,347; Scott and Smith, Sdence_249:386-390, 1992; 
Markland et al., Gene 109:13-19, 1991 ; each of which is incorporated herein by 
reference); a peptide library (U.S. Patent No. 5,264,563, which is incorporated herein 

15 by reference); a peptidomimetic library (Blondelle et al., Trends Anal. Chem. 

14:83-92, 1995; a nucleic acid library (O'Connell et al., supra, 1996; Tuerk and Gold, 
supra, 1990; Gold et al., supra, 1995; each of which is incorporated herein by 
reference); an oligosaccharide library (York et al., Carb. Res. . 285:99-128, 1996; 
Liang et al., Science . 274:1520-1522, 1996; Ding et al., Adv. Expt. Med. Biol. . 

20 376:261-269, 1995; each of which is incorporated herein by reference); a lipoprotein 
library (de Kruif et al., FEBS Lett. . 399:232-236, 1996, which is incorporated herein 
by reference); a glycoprotein or glycolipid library (Karaoglu et al., J. Cell Biol. . 
130:567-577, 1995, which is incorporated herein by reference); or a chemical library 
containing, for example, drugs or other pharmaceutical agents (Gordon et al., 

25 J. Med. Chem. . 37:1385-1401, 1994: Ecker and Crooke. Bio/Technology . 13:351-360, 
1995; each of which is incorporated herein by reference). Polynucleotides can be 
particularly useful as agents that can modulate a specific interaction of myostatin and 
its receptor because nucleic acid molecules having binding specificity for cellular 
targets, including cellular polypeptides, exist naturally, and because synthetic 

30 molecules having such specificity can be readily prepared and identified (see, for 
example, U.S. Patent No. 5,750,342, which is incorporated herein by reference). 
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In view of the present disclosure, it will be recognized that various animal 
model systems can be used as research tools to identify agents useful for practicing a 
method of the invention. For example, transgenic mice or other experimental animals 
can be prepared using the various myostatin inhibitor constructs disclosed herein, and 
5 the transgenic non-human organism can be examined directly to determine the effect 
produced by expressing various levels of a particular agent in the organism. In 
addition, the transgenic organism, for example, a transgenic mouse, can be crossbred 
with other mice, for example, with ob/ob, db/db, or agouti lethal yellow mutant mice, 
to determine optimal levels of expression of a myostatin inhibitor useful for treating 
10 or preventing a disorder such as obesity, type II diabetes, or the like. As such, the 
present invention provides transgenic non-human organisms, particularly transgenic 
organisms containing a polynucleotide encoding a myostatin prodomain, which can 
include the myostatin signal peptide, or a polynucleotide encoding a mutant 
promyostatin polypeptide. 

15 

Various methods are known for producing a transgenic animal. In one 
method, an embryo at the pronuclear stage (a "one cell embryo") is harvested from a 
female and the transgene is microinjected into the embryo, in which case the 
transgene will be chromosomally integrated into the germ cells and somatic cells of 

20 the resulting mature animal. In another method, embryonic stem cells are isolated and 
the transgene is incorporated into the stem cells by electroporation, plasmid 
transfection or microinjection; the stem cells are then reintroduced into the embryo, 
where they colonize and contribute to the germ line. Methods for microinjection of 
polynucleotides into mammalian species are described, for example, in U.S. Patent 

25 No. 4,873,1 91, which is incorporated herein by reference. In yet another method, 
embryonic cells are infected with a retrovirus containing the transgene, whereby the 
germ cells of the embryo have the transgene chromosomally integrated therein. 

When the animals to be made transgenic are avian, microinjection into the 
30 pronucleus of the fertilized egg is problematic because avian fertilized ova generally 
go through cell division for the first twenty hours in the oviduct and, therefore, the 
pronucleus is inaccessible. Thus, the retrovirus infection method is preferred for 
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making transgenic avian species (see U.S. Patent No. 5,162,215, which is 
incorporated herein by reference). If microinjection is to be used with avian species, 
however, the embryo can be obtained from a sacrificed hen approximately 2.5 hours 
after the laying of the previous laid egg, the transgene is microinjected into the 
5 cytoplasm of the germinal disc and the embryo is cultured in a host shell until 
maturity (Love et al., Biotechnology 12, 1994). When the animals to be made 
transgenic are bovine or porcine, microinjection can be hampered by the opacity of 
the ova,- thereby making the nuclei difficult to identify by traditional differential 
interference-contrast microscopy. To overcome this problem, the ova first can be 
10 centrifuged to segregate the pronuclei for better visualization. 

Non-human transgenic animals of the invention can be bovine, porcine, ovine, 
avian or other animals. The transgene can be introduced into embryonal target cells at 
various developmental stages, and different methods are selected depending on the 

1 5 stage of development of the embryonal target cell. The zygote is the best target for 
microinjection. The use of zygotes as a target for gene transfer has a major advantage 
in that the injected DNA can incorporate into the host gene before the first cleavage 
(Brinster et al., Proc. Natl. Acad. Sci.. USA 82:4438-4442, 1985). As a consequence, 
all cells of the transgenic non-human animal carry the incorporated transgene, thus 

20 contributing to efficient transmission of the transgene to offspring of the founder, 
since 50% of the germ cells will harbor the transgene. 

A transgenic animal can be produced by crossbreeding two chimeric animals, 
each of which includes exogenous genetic material within cells used in reproduction. 
25 Twenty-five percent of the resulting offspring will be transgenic animals that are 

homozygous for the exogenous genetic material, 50% of the resulting animals will be 
heterozygous, and the remaining 25% will lack the exogenous genetic material and 
have a wild type phenotype. 

30 In the microinjection method, the transgene is digested and purified free from 

any vector DNA, for example, by gel electrophoresis. The transgene can include an 
operatively associated promoter, which interacts with cellular proteins involved in 
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transcription, and provides for constitutive expression, tissue specific expression, 
developmental stage specific expression, or the like. Such promoters include those 
from cytomegalovirus (CMV), Moloney leukemia virus (MLV), and herpes virus, as 
well as those from the genes encoding metallothionein, skeletal actin, 
5 Phosphenolpyruvate carboxylase (PEPCK), phosphoglycerate (PGK), dihydrofolate 
reductase (DHFR), and thymidine kinase (TK). Promoters from viral long terminal 
repeats (LTRs) such as Rous sarcoma virus LTR also can be employed. When the 
animals to be made transgenic are avian, preferred promoters include those for the 
chicken (3-globin gene, chicken lysozyme gene, and avian leukosis virus. Constructs 
1 0 useful in plasmid transfection of embryonic stem cells will employ additional 
regulatory elements, including, for example, enhancer elements to stimulate 
transcription, splice acceptors, termination and polyadenylation signals, ribosome 
binding sites to permit translation, and the like. 

15 In the retroviral infection method, the developing non-human embryo can be 

cultured in vitro to the blastocyst stage. During this time, the blastomeres can be 
targets for retroviral infection (Jaenich, Proc. Natl. Acad. Sci. USA 73:1260-1264, 
1 976). Efficient infection of the blastomeres is obtained by enzymatic treatment to 
remove the zona pellucida (Hogan et al., Manipulating the Mouse Embryo (Cold 

20 Spring Harbor Laboratory Press, 1986). The viral vector system used to introduce the 
transgene is typically a replication-defective retrovirus carrying the transgene (Jahner 
et al., Proc. Natl. Acad. Sci.. USA 82:6927-6931, 1985; Van der Putten et al., Proc. 
Natl. Acad. Sci. USA 82:6148-6152, 1985). Transfection is easily and efficiently 
obtained by culturing the blastomeres on a monolayer of virus producing cells (Van 

25 der Putten et al., supra, 1985; Stewart et al., EMBO J. 6:383-388, 1987). 

Alternatively, infection can be performed at a later stage. Virus or virus-producing 
cells can be injected into the blastocoele (Jahner et al., Nature 298:623-628, 1982). 
Most of the founders will be mosaic for the transgene since incorporation occurs only 
in a subset of the cells which formed the transgenic nonhuman animal. Further, the 

30 founder can contain various retroviral insertions of the transgene at different positions 
in the genome, which generally will segregate in the offspring. In addition, it is also 
possible to introduce transgenes into the germ line, albeit with low efficiency, by 
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intrauterine retroviral infection of the mid-gestation embryo (Jahner et al., 
supra, 1982). 

Embryonal stem cell (ES) also can be targeted for introduction of the 
transgene. ES cells are obtained from pre-implantation embryos cultured in vitro and 
fused with embryos (Evans et al. Nature 292:154-156, 1981; Bradley et al., Nature 
309:255-258, 1984; Gossler et al., Proc.Natl. Acad. Sci.. USA 83:9065-9069, 1986; 
Robertson et al., Nature 322:445-448, 1986). Transgenes can be efficiently 
introduced into the ES cells by DNA transfection or by retrovirus mediated 
transduction. Such transformed ES cells can thereafter be combined with blastocysts 
from a nonhuman animal. The ES cells thereafter colonize the embryo and contribute 
to the germ line of the resulting chimeric animal (see Jaenisch, Science 
240:1468-1474, 1988). 

As disclosed herein, myostatin can exert its activity, at least in part, through 
the Smad signal transduction pathway, and myostatin expression can be associated 
with various pathological conditions. As such, the present invention provides new 
targets for the treatment of various pathological conditions associated with myostatin, 
including metabolic conditions such as obesity and type II diabetes. Accordingly, the 
present invention provides methods for ameliorating the severity of a pathological 
condition in a subject, wherein the pathologic condition is characterized at least in 
part by an abnormal amount, development or metabolic activity of muscle or adipose 
tissue, by modulating myostatin signal transduction in a muscle cell or adipose tissue 
cell in the subject. 

Myostatin functions as a negative regulator of muscle growth (McPherron et 
al., supra, 1997). Myostatin knock-out mice weighed approximately 25% to 30% 
more than wild type littermates, and this increase in body weight in the mice 
examined resulted entirely from a dramatic increase in skeletal muscle tissue weight. 
In mice lacking myostatin, the skeletal muscles weighed about 2 to 3 times as much as 
the corresponding muscles of wild type littermates. This increased muscle weight in 
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the homozygous knock-out mice resulted from a combination of hyperplasia and 
hypertrophy. 

As disclosed herein, heterozygous myostatin knock-out mice also have 
5 increased skeletal muscle mass, although to a lesser extent than that observed in 

homozygous mutant mice, thus demonstrating that myostatin acts in a dose-dependent 
manner in vivo (see Example 1). Furthermore, overexpression of myostatin in 
animals had the opposite effect with respect to muscle growth. For example, nude 
mice carrying myostatin-expressing tumors developed a wasting syndrome 
10 characterized by a dramatic loss of muscle and fat weight (see Example 8). This 

syndrome in the nude mice resembled the cachectic state that occurs in patients with 
chronic diseases such as cancer or AIDS. 

The serum levels of myostatin immunoreactive material have been correlated 
1 5 with the status of patients with respect to muscle wasting (Gonzalez-Kadavid et al., 
Proc. Natl. Acad. Med.. USA 95:14938-14943, 1998, which is incorporated herein by 
reference). Thus, patients with AIDS, who also showed signs of cachexia as 
measured by loss of total body weight, had slightly increased serum levels of 
myostatin immunoreactive material compared to either normal males without AIDS or 
20 to AIDS patients that did not have weight loss. However, the interpretation of these 
results was complicated because the myostatin immunoreactive material detected in 
the serum samples did not have the mobility on SDS gels that was expected for 
authentic processed myostatin. 

25 As disclosed herein, myostatin not only affects muscle mass, but also affects 

the overall metabolism of an organism. For example, myostatin is expressed in 
adipose tissue, and myostatin deficient mice have a dramatic reduction in fat 
accumulation as the animals age (see Examples II and III). Although no mechanism 
for myostatin action is proposed herein, the effect of myostatin can be direct effect of 

30 myostatin on adipose tissue, or can be an indirect effect caused by the lack of 

myostatin activity on skeletal muscle tissue. Regardless of the mechanism, the overall 
anabolic effect on muscle tissue that results in response to decreased myostatin 
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activitycan alter the overall metabolism of the organism and affect the storage of 
energy in the form of fat, as demonstrated by the introduction of a myostatin mutation 
into an obese mouse strain (agouti lethal yellow (A y ) mice), which suppressed fat 
accumulation by five-fold (see Example 5). Abnormal glucose metabolism also was 
5 partially suppressed in agouti mice containing the myostatin mutation. These results 
demonstrate that methods that inhibition of myostatin can be used to treat or prevent 
metabolic diseases such as obesity and type II diabetes. 

The methods of the invention are useful, for example, for ameliorating the 
10 severity of various pathologic conditions, including, for example, the cachexia 

associated with chronic diseases such as cancer (see Norton et al., Crit. Rev. Oncol. 
Hematol. 7:289-327, 1987), as well as conditions such as type II diabetes, obesity, and 
other metabolic disorders. As used herein, the term "pathologic condition" refers to a 
disorder that is characterized, at least in part, by an abnormal amount, development or 
1 5 metabolic activity of muscle or adipose tissue. Such pathologic conditions, which 
include,, for example, obesity; conditions associated with obesity, for example, 
atherosclerosis, hypertension, and myocardial infarction; muscle wasting disorders such 
as muscular dystrophy, neuromuscular diseases, cachexia, and anorexia; and metabolic 
disorders such as type II diabetes, which generally, but not necessarily, is associated with 
20 obesity, are particularly amenable to treatment using a method of the invention. 

As used herein, the term "abnormal," when used in reference to the amount, 
development or metabolic activity of muscle or adipose tissue, is used in a relative 
sense in comparison to an amount, development or metabolic activity that a skilled 

25 clinician or other relevant artisan would recognize as being normal or ideal. Such 
normal or ideal values are known to the clinician and are based on average values 
generally observed or desired in a healthy individual in a corresponding population. 
For example, the clinician would know that obesity is associated with a body weight 
that is about twenty percent above an "ideal" weight range for a person of a particular 

30 height and body type. However, the clinician would recognize that a body builder is 
not necessarily obese simply by virtue of having a body weight that is twenty percent 
or more above the weight expected for a person of the same height and body type in 
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an otherwise corresponding population. Similarly, the artisan would know that a 
patient presenting with what appears to abnormally decreased muscle activity could 
be identified as having abnormal muscle development, for example, by subjecting the 
patient to various strength tests and comparing the results with those expected for an 
5 average healthy individual in a corresponding population. 

A method of the invention can ameliorate the severity of a pathologic 
condition that is characterized, at least in part, by an abnormal amount, development 
or metabolic activity in muscle or adipose tissue, by modulating myostatin signal 

10 transduction in a muscle or adipose tissue cell associated with the etiology of the 
condition. As used herein, the term "ameliorate," when used in reference to the 
severity of a pathologic condition, means that signs or symptoms associated with the 
condition are lessened. The signs or symptoms to be monitored will be characteristic of 
a particular pathologic condition and will be well known to skilled clinician, as will the 

1 5 methods for monitoring the signs and conditions. For example, where the pathologic 

condition is type II diabetes, the skilled clinician can monitor the glucose levels, glucose 
clearance rates, and the like in the subject. Where the pathologic condition is obesity or 
cachexia, the clinician can simply monitor the subject's body weight. 

20 The agent to be administered to the subject is administered under conditions 

that facilitate contact of the agent with the target cell and, if appropriate, entry into the 
cell. Entry of a polynucleotide agent into a cell, for example, can be facilitated by 
incorporating the polynucleotide into a viral vector that can infect the cells. If a viral 
vector specific for the cell type is not available, the vector can be modified to express 

25 a receptor (or ligand) specific for a ligand (or receptor) expressed on the target cell, or 
can be encapsulated within a liposome, which also can be modified to include such a 
ligand (or receptor). A peptide agent can be introduced into a cell by various methods, 
including, for example, by engineering the peptide to contain a protein transduction 
domain such as the human immunodeficiency virus TAT protein transduction domain, 

30 which can facilitate translocation of the peptide into the cell (see Schwarze et al., supra, 
1999; Derossi etal, supra, 1996). 
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The presence of the agent in the target cell can be identified directly, for 
example, by operatively linking a detectable label to the agent, by using an antibody 
specific for the agent, particularly a peptide agent, or by detecting a downstream 
effect due to the agent, for example, decreased phosphorylation of an Smad 
5 polypeptide in the cell. An agent can be labeled so as to be detectable using methods 
well known in the art (Hermanson, "Bioconjugate Techniques" (Academic Press 
1996), which is incorporated herein by reference; see, also, Harlow and Lane, supra, 
1988). For example, a peptide or polynucleotide agent can be labeled with various 
detectable moieties including a radiolabel, an enzyme such as alkaline phosphatase, 
1 0 biotin, a fluorochrome, and the like. Where the agent is contained in a kit, the 

reagents for labeling the agent also can be included in the kit, or the reagents can be 
purchased separately from a commercial source. 

An agent useful in a method of the invention can be administered to the site of 
1 5 the pathologic condition, or can be administered by any method that provides the target 
cells with the polynucleotide or peptide. As used herein, the term "target cells" means 
muscle cells or adipocytes that are to be contacted with the agent. For administration to 
a living subject, the agent generally is formulated in a pharmaceutical composition 
suitable for administration to the subject. Thus, the invention provides pharmaceutical 
20 compositions containing an agent, which is useful for modulating myostatin signal 

transduction in a cell, in a pharmaceutically acceptable carrier. As such, the agents are 
useful as medicaments for treating a subject suffering from a pathological condition as 
defined herein. 

25 Pharmaceutically acceptable carriers are well known in the art and include, for 

example, aqueous solutions such as water or physiologically buffered saline or other 
solvents or vehicles such as glycols, glycerol, oils such as olive oil or injectable 
organic esters. A pharmaceutically acceptable carrier can contain physiologically 
acceptable compounds that act, for example, to stabilize or to increase the absorption 

30 of the conjugate. Such physiologically acceptable compounds include, for example, 
carbohydrates, such as glucose, sucrose or dextrans, antioxidants, such as ascorbic 
acid or glutathione, chelating agents, low molecular weight proteins or other 
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stabilizers or excipients. One skilled in the art would know that the choice of a 
pharmaceutically acceptable carrier, including a physiologically acceptable 
compound, depends, for example, on the physico-chemical characteristics of the 
therapeutic agent and on the route of administration of the composition, which can be, 
5 for example, orally or parenterally such as intravenously, and by injection, intubation, 
or other such method known in the art. The pharmaceutical composition also can 
contain a second reagent such as a diagnostic reagent, nutritional substance, toxin, or 
therapeutic agent, for example, a cancer chemotherapeutic agent. 

10 The agent can be incorporated within an encapsulating material such as into an 

oil-in-water emulsion, a microemulsion, micelle, mixed micelle, liposome, 
microsphere or other polymer matrix (see, for example, Gregoriadis, Liposome 
Technology . Vol. 1 (CRC Press, Boca Raton, FL 1984); Fraley, et al., Trends 
Biochem. Sci. . 6:77 (1981), each of which is incorporated herein by reference). 

15 Liposomes, for example, which consist of phospholipids or other lipids, are nontoxic, 
physiologically acceptable and metabolizable carriers that are relatively simple to 
make and administer. "Stealth" liposomes (see, for example, U.S. Patent 
Nos. 5,882,679; 5,395,619; and 5,225,212, each of which is incorporated herein by 
reference) are an example of such encapsulating materials particularly useful for 

20 preparing a pharmaceutical composition useful for practicing a method of the 
invention, and other "masked" liposomes similarly can be used, such liposomes 
extending the time that the therapeutic agent remain in the circulation. Cationic 
liposomes, for example, also can be modified with specific receptors or ligands 
(Morishita et al., J. Clin. Invest. . 91 :2580-2585 (1993), which is incorporated herein 

25 by reference). In addition, a polynucleotide agent can be introduced into a cell using, 
for example, adenovirus-polylysine DNA complexes (see, for example, Michael et al., 
J. Biol. Chem. 268:6866-6869 (1993), which is incorporated herein by reference). 

The route of administration of a pharmaceutical composition containing an 
30 agent that alters myostatin signal transduction will depend, in part, on the chemical 
structure of the molecule. Polypeptides and polynucleotides, for example, are not 
particularly useful when administered orally because they can be degraded in the 
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digestive tract. However, methods for chemically modifying polypeptides, for 
example, to render them less susceptible to degradation by endogenous proteases or 
more absorbable through the alimentary tract are well known (see, for example, 
Blondelle et al., supra, 1995; Ecker and Crook, supra, 1995). In addition, a peptide 
5 agent can be prepared using D-amino acids, or can contain one or more domains 
based on peptidomimetics, which are organic molecules that mimic the structure of 
peptide domain; or based on a peptoid such as a vinylogous peptoid. 

A pharmaceutical composition as disclosed herein can be administered to an 
10 individual by various routes including, for example, orally or parenterally, such as 
intravenously, intramuscularly, subcutaneously, intraorbitally, intracapsularly, 
intraperitoneally, intrarectally, intracisternally or by passive or facilitated absorption 
through the skin using, for example, a skin patch or transdermal iontophoresis, 
respectively. Furthermore, the pharmaceutical composition can be administered by 
1 5 injection, intubation, orally or topically, the latter of which can be passive, for 

example, by direct application of an ointment, or active, for example, using a nasal 
spray or inhalant, in which case one component of the composition is an appropriate 
propellant. A pharmaceutical composition also can be administered to the site of a 
pathologic condition, for example, intravenously or intra-arterially into a blood vessel 
20 supplying a tumor. 

The total amount of an agent to be administered in practicing a method of the 
invention can be administered to a subject as a single dose, either as a bolus or by 
infusion over a relatively short period of time, or can be administered using a 

25 fractionated treatment protocol, in which multiple doses are administered over a 

prolonged period of time. One skilled in the art would know that the amount of the 
pharmaceutical composition to treat a pathologic condition in a subject depends on 
many factors including the age and general health of the subject as well as the route of 
administration and the number of treatments to be administered. In view of these 

30 factors, the skilled artisan would adjust the particular dose as necessary. In general, 
the formulation of the pharmaceutical composition and the routes and frequency of 
administration are determined, initially, using Phase I and Phase II clinical trials. 
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The pharmaceutical composition can be formulated for oral formulation, such as 
a tablet, or a solution or suspension form; or can comprise an admixture with an organic 
or inorganic carrier or excipient suitable for enteral or parenteral applications, and can be 
5 compounded, for example, with the usual non-toxic, pharmaceutically acceptable 

carriers for tablets, pellets, capsules, suppositories, solutions, emulsions, suspensions, or 
other form suitable for use. The carriers, in addition to those disclosed above, can 
include glucose, lactose, mannose, gum acacia, gelatin, mannitol, starch paste, 
magnesium trisilicate, talc, corn starch, keratin, colloidal silica, potato starch, urea, 
1 0 medium chain length triglycerides, dextrans, and other carriers suitable for use in 

manufacturing preparations, in solid, semisolid, or liquid form. In addition auxiliary, 
stabilizing, thickening or coloring agents and perfumes can be used, for example a 
stabilizing dry agent such as triulose (see, for example, U.S. Patent No. 5,314,695). 

1 5 The present invention also provides a method of modulating the growth of 

muscle tissue or adipose tissue in a subject. As disclosed herein, GDF receptors such 
as Act RIIA and Act RIIB are involved in mediating the effects of a GDF such as 
myostatin, which is involved in muscle tissue and adipose tissue formation. Thus, in 
one embodiment, a method of modulating the growth of muscle tissue or adipose 

20 tissue includes affecting signal transduction from a GDF receptor, such as an activin 
receptor, e.g., Act RIIA or Act RIIB. Such a method can be performed by contacting 
cells of the tissue, or expressing in the cells, a mutant GDF receptor, which has 
dominant negative activity, constitutive activity, or the like. 

25 In another embodiment, a method of modulating the growth of muscle tissue 

or adipose tissue in an organism is performed by administering to the organism an 
agent that affects myostatin signal transduction. Preferably, the agent is or encodes a 
myostatin prodomain or a mutant promyostatin polypeptide, either of which can 
include a myostatin signal peptide. As used herein, the term "growth" is used in a 

30 relative sense in referring to the mass of muscle tissue or mass of adipose tissue in an 
organism that has been subjected to a method of the invention as compared to a 
corresponding organism that has not been subjected to a method of the invention. 
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Thus, where a method of the invention is performed such that myostatin signal 
transduction has been reduced or inhibited, it will be recognized that the growth of 
muscle tissue in the organism would result in an increased muscle mass in the 
organism as compared to the muscle mass of a corresponding organism (or population 
5 of organisms) in which myostatin signal transduction had not been so effected. 

A method of the invention can be useful for increasing the muscle mass or 
reducing the fat content of an organism or both. For example, where such a method is 
performed on an organism that is useful as a food source, the protein content of the 

10 food can be increased, the cholesterol level can be decreased, and the quality of the 
foodstuff can be improved. A method of the invention also can be useful for 
decreasing the growth of muscle tissue in an organism, for example, an organism that 
is detrimental to an environment, such that the organism is less able to compete in the 
environment. Thus, a method of the invention can be performed on any eukaryotic 

1 5 organism that expresses myostatin, including a vertebrate organism, for example, 
mammalian, avian or piscine organism, or can be an invertebrate organism, for 
example, a mollusk, echinoderm, gastropod or cephalopod. 

The agent can be any agent that alters myostatin signal transduction, as 
20 disclosed herein, and can be administered to the organism in any convenient manner. 
For example, where the organism to be treated are fish, shrimp, scallops, or the like, 
which are grown in aquaculture, the agent can be added to the water in which the 
organisms are maintained or can be included in their food, particularly where the 
agent is a soluble peptide or a small organic molecule. 

25 

Where the agent used in a method of the invention is a polynucleotide that 
encodes a peptide agent, an antisense agent, or the like, germ cells of a non-human 
organism containing the polynucleotide can be selected and transgenic organisms 
expressing the agent can be produced. Preferably, the polynucleotide is under the 
30 control of an inducible regulatory element, such that the agent encoded by the 

polynucleotide can be expressed at a time and for a duration as desired. Accordingly, 
the present invention provides transgenic nonhuman organisms, as well as food 
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products produced by these organisms. Such food products have increased nutritional 
value because of the increase in muscle tissue. The transgenic non-human animals 
can be any species as disclosed herein, including vertebrate organisms such as cattle, 
pigs, sheep, chicken, turkey and fish, and invertebrate species such as shrimp, lobster, 
5 crabs, squid, oysters and abalone. 

The regulation of TGF-(3 family members and their specific interactions with 
cell surface receptors are beginning to be elucidated. Thus, coexpression of the 
prodomain of a TGF-p family member with a mature region of another member of the 

10 TGF-P family is associated with intracellular dimerization and secretion of 

biologically active homodimers occur (Gray et al., Science 247:1328, 1990). For 
example, use of the BMP-2 prodomain with the BMP-4 mature region led to 
dramatically improved expression of mature BMP-4 (Hammonds et al., (Mol. 
Endocrinol. 5:149, 1991). For most of the family members that have been studied, the 

1 5 homodimeric species are biologically active, whereas for other family members such 
as the inhibins (Ling et al, Nature 321 :779, 1986) and the TGF-|3s (Cheifetz et al, 
Cell 48:409, 1987), heterodimers also have been detected and appear to have different 
biological properties than the respective homodimers. 

20 Receptor-ligand interaction studies have revealed a great deal of information 

as to how cells respond to external stimuli, and have led to the development of 
therapeutically important compounds such as erythropoietin, the colony stimulating 
factors, and PDGF. Thus, continual efforts have been made at identifying the 
receptors that mediate the action of the TGF-P family members. As disclosed herein, 

25 myostatin specifically interacts with an activin type II receptor. The identification of 
this interaction provides targets for identifying antagonists and agonists useful for 
agricultural and human therapeutic purposes, for example, for treating in various 
pathological conditions such as obesity, type II diabetes, and cachexia. The 
identification of this specific interaction also provides a means to identify other 

30 myostatin receptors, as well as the specific receptors of other growth differentiation 
factors. Accordingly, the present invention provides GDF receptors, which 
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specifically interact with a GDF or combination of GDFs, for example, with 
myostatin, GDF-11, or both. 

A GDF receptor of the invention is exemplified herein by a myostatin 
5 receptor, particularly an activin type II receptor, which specifically interacts with 
myostatin and with GDF-1 1 . However, myostatin receptors that specifically interact 
with myostatin, but not GDF-1 1, also are encompassed within the present invention, 
as are GDF-1 1 receptors that specifically interact with GDF-1 1 but not myostatin, and 
the like. For convenience of discussion, the receptors of the invention are referred to 

10 herein generally as a "GDF receptor" and are exemplified by a myostatin receptor, 
which is a receptor that specifically interacts at least with myostatin. As such, while 
reference is made generally to a specific interaction of myostatin with a myostatin 
receptor, it will be recognized that the present disclosure more broadly encompasses 
any GDF receptor, including a GDF-1 1 receptor, which specifically interacts at least 

15 with GDF-11. 

A recombinant cell line that expresses a GDF receptor polypeptide also is 
provided, as are antibodies that specifically bind the receptor, substantially purified 
polynucleotides that encode the receptor, and substantially purified GDF receptor 

20 polypeptides. Peptide portions of a GDF receptor also are provided, including, for 
example, soluble extracellular domains of a GDF receptor such as a myostatin 
receptor, which, as disclosed herein, can alter the specific interaction of myostatin 
with a cellular myostatin receptor; a constitutively active intracellular kinase domain 
of a GDF receptor, which can induce, stimulate or otherwise maintain GDF signal 

25 transduction in a cell; or other truncated portion of a GDF receptor having an ability 
to modulate myostatin or other GDF signal, transduction. 

The invention also provides methods for identifying a GDF receptor 
polypeptide, including methods of screening genomic or cDNA libraries, which can 
30 be expression libraries, using nucleotide probes or antibody probes; methods of 

screening cells that are responsive to and, therefore, express the receptor, using, for 
example, a GDF such as myostatin or a functional peptide portion thereof; two hybrid 



Gray Cary\GT\6209497. 1 
104659-160348 



70 



assays, as described above, using, for example, the GDF peptide as a component of 
one hybrid and peptides expressed from a cDNA library, which is prepared from cells 
expressing a receptor for the GDF, as components of the second hybrids, and the like. 

5 As described above, agents that specifically interact with a GDF receptor, for 

example, a myostatin receptor such as Act RIIB can be identified by using the 
receptor to screen for such agents. Conversely, an agent that has been identified as 
having the ability to specifically interact with a myostatin receptor such as the 
Act RIIB receptor, can be used to screen for additional myostatin receptors or other 

10 GDF receptors. Such a method can include incubating components such as the agent 
(or myostatin or other GDF) and a cell expressing a GDF receptor, which can be a 
truncated membrane bound receptor or a soluble receptor, under conditions sufficient 
to allow the agent (or GDF) to interact specifically with the receptor; measuring the 
agent (or GDF) bound to the receptor; and isolating the receptor. A method of 

1 5 molecular modeling as described above also can be useful as a screening method for 
identifying a GDF receptor, or a functional peptide portion thereof. 

Non-human transgenic animals that have a phenotype characterized by 
expression of a GDF receptor also are provided, the phenotype being conferred by a 
20 transgene contained in the somatic and germ cells of the animal. The transgene 
comprises a polynucleotide encoding the GDF receptor, for example, myostatin 
receptor, polypeptide. Methods of producing such transgenic animals are disclosed 
herein or otherwise known in the art. 

25 The present invention provides a substantially purified polynucleotide 

encoding all or a peptide portion of a GDF receptor. Although a GDF receptor is 
exemplified herein by an activin type II receptor, polynucleotides encoding activin 
type II receptors previously have been described (U.S. Patent No. 5,885,794). Thus, it 
should be recognized that such activin type II receptors are not encompassed within 

30 the present invention (Massague, supra, 1998; Heldin et al., supra, 1997). Similarly, 
activin type I receptors, including Act RIB; TGF-f3 receptors, including TGF-p RI and 
TGF-P RII; and BMP receptors, including BMP RIA, BMP RIB and BMP RII, have 
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been described and are well known in the art (Massague, supra, 1998; Heldin et al., 
supra, 1997) and, therefore, are not encompassed within the GDF receptors of the 
invention. 

5 A polynucleotide of the invention can encode a polypeptide having a 

myostatin receptor activity, for example, myostatin binding activity, or can encode a 
mutant myostatin receptor, for example, a mutant myostatin receptor having a 
mutation in a kinase domain, such that the mutant acts as a dominant negative 
myostatin receptor (see above). Thus, a polynucleotide of the invention can be a 

10 naturally occurring, synthetic, or intentionally manipulated polynucleotide. For 
example, portions of the mRNA sequence can be altered due to alternate RNA 
splicing patterns or the use of alternate promoters for RNA transcription. As another 
example, the polynucleotide can be subjected to site directed mutagenesis. The 
polynucleotide also can be antisense nucleotide sequence. GDF receptor 

1 5 polynucleotides of the invention include sequences that are degenerate as a result of 
the genetic code. There are 20 natural amino acids, most of which are specified by 
more than one codon. Therefore, all degenerate nucleotide sequences are included 
within the invention, provided the amino acid sequence of the GDF receptor 
polypeptide encoded by the polynucleotide is functionally unchanged. Also included 

20 are nucleotide sequences that encode myostatin receptor polypeptides. 

Oligonucleotide portions of a polynucleotide encoding a GDF receptor of the 
invention also are encompassed within the present invention. Such oligonucleotides 
generally are at least about 15 bases in length, which is sufficient to permit the 

25 oligonucleotide to selectively hybridize to a polynucleotide encoding the receptor, and 
can be at least about 18 nucleotides or 21 nucleotides or more in length. As used 
herein, the term "selective hybridization" or "selectively hybridize" refers to 
hybridization under moderately stringent or highly stringent physiological conditions, 
which can distinguish related nucleotide sequences from unrelated nucleotide 

30 sequences. 
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In nucleic acid hybridization reactions, the conditions used to achieve a 
particular level of stringency will vary, depending on the nature of the nucleic acids 
being hybridized. For example, the length, degree of complementarity, nucleotide 
sequence composition (for example, relative GC:AT content), and nucleic acid type, 
5 i.e., whether the oligonucleotide or the target nucleic acid sequence is DNA or RNA, 
can be considered in selecting hybridization conditions. An additional consideration 
is whether one of the nucleic acids is immobilized, for example, on a filter. Methods 
for selecting appropriate stringency conditions can be determined empirically or 
estimated using various formulas, and are well known in the art (see, for example, 
1 0 Sambrook et al. , supra, 1 989). 

An example of progressively higher stringency conditions is as follows: 
2X SSG/0.1% SDS at about room temperature (hybridization conditions); 
0.2X SSC/0.1% SDS at about room temperature (low stringency conditions); 
1 5 0.2X SSC/0. 1 % SDS at about 42°C (moderate stringency conditions); and 0. IX SSC 
at about 68°C (high stringency conditions). Washing can be carried out using only 
one of these conditions, for example, high stringency conditions, or each of the 
conditions can be used, for example, for 10 to 15 minutes each, in the order listed 
above, repeating any or all of the steps listed. 

20 

A GDF receptor-encoding polynucleotide of the invention can be obtained by 
any of several methods. For example, the polynucleotide can be isolated using 
hybridization or computer based techniques, as are well known in the art. These 
methods include, but are not limited to, 1) hybridization of genomic or cDNA 

25 libraries with probes to detect homologous nucleotide sequences; 2) antibody 
screening of expression libraries to detect cloned DNA fragments with shared 
structural features; 3) polymerase chain reaction (PCR) on genomic DNA or cDNA 
using primers capable of annealing to the DNA sequence of interest; 4) computer 
searches of sequence databases for similar sequences (see above); 5) differential 

30 screening of a subtracted DNA library; and 6) two hybrid assays using, for example, a 
mature GDF peptide in one of the hybrids. 
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In view of the present disclosure that an activin receptor specifically interacts 
with myostatin, oligonucleotide probes can be designed based on the sequence 
encoding an activin receptor, for example, a sequence encoding the extracellular 
domain, which binds myostatin, and used to screen a library prepared from cells such 
as muscle cells or adipocytes, which are responsive to myostatin, thus facilitating 
identification of a polynucleotide encoding a myostatin receptor. Selected clones can 
be further screened, for example, by subcloning the inserts into an expression vector 
and, following expression of the cloned sequences, screening the expressed 
polypeptides using myostatin. 

A polynucleotide of the invention, for example, a polynucleotide encoding a 
myostatin receptor, can be derived from a vertebrate species, including a mammalian, 
avian, or piscine species, or from an invertebrate species. Screening procedures that 
rely on nucleic acid hybridization allow the isolation any gene sequence from any 
organism, provided the appropriate probe is available. Oligonucleotide probes that 
correspond to a part of the sequence encoding the protein in question can be 
synthesized chemically. This requires that short, oligopeptide stretches of amino acid 
sequence are known. A polynucleotide sequence encoding the receptor can be 
deduced from the genetic code, taking into account the degeneracy of the genetic 
code. Thus, mixed addition reactions can be performed when the sequence is 
degenerate. This includes a heterogeneous mixture of denatured double stranded 
DNA. For such screening, hybridization is preferably performed on either single 
stranded DNA or denatured double stranded DNA. Hybridization is particularly 
useful in the detection of cDNA clones derived from sources where an extremely low 
amount of mRNA sequences relating to the polypeptide of interest are present. Thus, 
by using stringent hybridization conditions directed to avoid nonspecific binding, 
autoradiographic visualization can be used to identify a specific cDNA clone by the 
hybridization of the target DNA to an oligonucleotide probe in the mixture that is the 
complete complement of the target nucleic acid (Wallace et al., Nucl. Acid Res. . 
9:879, 1981, which is incorporated herein by reference). Alternatively, a subtractive 
library can be used, thereby eliminating nonspecific cDNA clones. 
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When the entire amino acid sequence of a desired polypeptide is not known, 
the direct synthesis of DNA sequences is not possible and the method of choice is the 
synthesis of cDNA sequences. Among the standard procedures for isolating cDNA 
sequences of interest is the formation of cDNA libraries prepared in plasmids or 
5 phage, wherein the libraries are derived from reverse transcription of mRNA that is 
abundant in donor cells having a high level of genetic expression. When used in 
combination with polymerase chain reaction technology, even rare expression 
products can be cloned. Where significant portions of the amino acid sequence of the 
polypeptide are known, the production of labeled single stranded or double stranded 
1 0 DNA or RNA probe sequences duplicating a sequence putatively present in the target 
cDNA can be employed in hybridization procedures carried out on cloned copies of 
the cDNA, which have been denatured into a single stranded form (Jay et al., Nucl. 
Acid Res. . 1 1:2325, 1983, which is incorporated herein by reference). 

1 5 A cDNA expression library, such as a lambda gtl 1 library, can be screened for 

GDF receptor peptides using an antibody specific for a GDF receptor, for example, an 
anti-Act RIIB antibody. The antibody can be polyclonal or monoclonal, and can be 
used to detect expression product indicative of the presence of a GDF receptor cDNA. 
Such an expression library also can be screened with a GDF peptide, for example, 

20 with myostatin, or a functional peptide portion thereof, to identify a clone encoding at 
least a portion of a myostatin binding domain of a myostatin receptor. 

Polynucleotides encoding mutant GDF receptors and mutant GDF receptor 
polypeptides are also encompassed within the invention. An alteration in a 

25 polynucleotide encoding a GDF receptor can be an intragenic mutation such as point 
mutation, nonsense (STOP) mutation, missense mutation, splice site mutation or 
frameshift, or can be a heterozygous or homozygous deletion, and can be a naturally 
occurring mutation or can be engineered using recombinant DNA methods, for 
example. Such alterations can be detected using standard methods known to those of 

30 skill in the art, including, but not limited to, nucleotide sequence analysis, Southern 
blot analysis, a PCR based analysis such as multiplex PCR or sequence tagged sites 
(STS) analysis, or in situ hybridization analysis. GDF receptor polypeptides can be 
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analyzed by standard SDS-PAGE, immunoprecipitation analysis, western blot 
analysis, or the like. Mutant GDF receptors are exemplified by truncated GDF 
receptors, including a soluble extracellular domain, which can have the ability to 
specifically bind its cognate GDF, but lacks a kinase domain; an intracellular GDF 
5 receptor kinase domain, which can exhibit constitutive kinase activity; as well as by 
GDF receptors that contain a point mutation, which disrupts the kinase activity of the 
receptor or the ligand binding ability of the receptor; and the like. Such GDF receptor 
mutants are useful for modulating GDF signal transduction and, therefore, for 
practicing various methods of the invention. 

10 

A polynucleotide encoding a GDF receptor can be expressed in vitro by 
introducing the polynucleotide into a suitable host cell. "Host cells" can be any cells 
in which the particular vector can be propagated, and, where appropriate, in which a 
polynucleotide contained in the vector can be expressed. The term "host cells" 
1 5 includes any progeny of an original host cell. It is understood that all progeny of the 
host cell may not be identical to the parental cell due, for example, to mutations that 
occur during replication. However, such progeny are included when the term "host 
cell" is used. Methods of obtaining a host cell that transiently or stably contains an 
introduced polynucleotide of the invention are well known in the art. 

20 

A GDF receptor polynucleotide of the invention can be inserted into a vector, 
which can be a cloning vector or a recombinant expression vector. The term 
"recombinant expression vector" refers to a plasmid, virus or other vehicle known in 
the art that has been manipulated by insertion or incorporation of a polynucleotide, 

25 particularly, with respect to the present invention, a polynucleotide encoding all or a 
peptide portion of a GDF receptor. Such expression vectors contain a promoter 
sequence, which facilitates the efficient transcription of the inserted genetic sequence 
of the host. The expression vector generally contains an origin of replication, a 
promoter, as well as specific genes which allow phenotypic selection of the 

30 transformed cells. Vectors suitable for use in the present invention include, but are 
not limited to, the T7-based expression vector for expression in bacteria (Rosenberg, 
et al., Gene 56:125, 1987), the pMSXND expression vector for expression in 
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mammalian cells (Lee and Nathans, J. Biol. Chem. 263:3521, 1988) and 
baculovirus-derived vectors for expression in insect cells. The DNA segment can be 
present in the vector operably linked to regulatory elements, for example, a promoter, 
which can be a T7 promoter, metallothionein I promoter, polyhedrin promoter, or 
5 other promoter as desired, particularly tissue specific promoters or inducible 
promoters. 

A polynucleotide sequence encoding a GDF receptor can be expressed in 
either prokaryotes or eukaryotes. Hosts can include microbial, yeast, insect and 

10 mammalian organisms. Methods of expressing polynucleotides having eukaryotic or 
viral sequences in prokaryotes are well known in the art, as are biologically functional 
viral and plasmid DNA vectors capable of expression and replication in a host. 
Methods for constructing an expression vector containing a polynucleotide of the 
invention are well known, as are factors to be considered in selecting transcriptional 

1 5 or translational control signals, including, for example, whether the polynucleotide is 
to be expressed preferentially in a particular cell type or under particular conditions 
(see, for example, Sambrook et al., supra, 1989). 

A variety of host cell/expression vector systems can be utilized to express a 
20 GDF receptor coding sequence, including, but not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors; yeast cells transformed with recombinant yeast expression 
vectors; plant cell systems infected with recombinant virus expression vectors such as 
a cauliflower mosaic virus or tobacco mosaic virus, or transformed with recombinant 
25 plasmid expression vector such as a Ti plasmid; insect cells infected with recombinant 
virus expression vectors such as a baculovirus; animal cell systems infected with 
recombinant virus expression vectors such as a retrovirus, adenovirus or vaccinia 
virus vector; and transformed animal cell systems genetically engineered for stable 
expression. Where the expressed GDF receptor is post-translationally modified, for 
30 example, by glycosylation, it can be particularly advantageous to select a host 

cell/expression vector system that can effect the desired modification, for example, a 
mammalian host cell/expression vector system. 
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Depending on the host cell/vector system utilized, any of a number of suitable 
transcription and translation elements, including constitutive and inducible promoters, 
transcription enhancer elements, transcription terminators, and the like can be used in 
5 the expression vector (Bitter et al., Meth. Enzymol. 153:51 6-544, 1 987). For 
example, when cloning in bacterial systems, inducible promoters such as pL of 
bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like can be used. 
When cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells, for example, a human or mouse metallothionein promoter, or from 
1 0 mammalian viruses, for example, a retrovirus long terminal repeat, an adenovirus late 
promoter or a vaccinia virus 7.5K promoter, can be used. Promoters produced by 
recombinant DNA or synthetic techniques can also be used to provide for 
transcription of the inserted GDF receptors coding sequence. 

15 In yeast cells, a number of vectors containing constitutive or inducible 

promoters can be used (see Ausubel et al., supra, 1987, see chapter 13; Grant et al., 
Meth. Enzvmol. 153:516-544, 1987; Glover, DNA Cloning Vol. II (IRL Press, 1986), 
see chapter 3; Bitter, Meth. Enzymol. 152:673-684, 1987; see, also, The Molecular 
Biology of the Yeast Saccharomyces (Eds., Strathern et al., Cold Spring Harbor 

20 Laboratory Press, 1982), Vols. I and II). A constitutive yeast promoter such as ADH 
or LEU2 or an inducible promoter such as GAL can be used (Rothstein, DNA 
Cloning Vol. II (supra, 1986), chapter 3). Alternatively, vectors can be used which 
promote integration of foreign DNA sequences into the yeast chromosome. 

25 Eukaryotic systems, particularly mammalian expression systems, allow for 

proper post-translational modifications of expressed mammalian proteins. Eukaryotic 
cells which possess the cellular machinery for proper processing of the primary 
transcript, glycosylation, phosphorylation, and advantageously, plasma membrane 
insertion of the gene product can be used as host cells for the expression of a GDF 

30 receptor polypeptide, or functional peptide portion thereof. 
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Mammalian cell systems which utilize recombinant viruses or viral elements 
to direct expression can be engineered. For example, when using adenovirus 
expression vectors, the GDF receptors coding sequence can be ligated to an 
adenovirus transcription/translation control complex, e.g., the late promoter and 
tripartite leader sequence. Alternatively, the vaccinia virus 7.5K promoter can be 
used (Mackett et al., Proc. Natl. Acad. Sci.. USA 79:7415-7419, 1982; Mackett et al, 
J, Virol. 49:857-864, 1984; Panicali et al, Proc. Natl. Acad. Sci.. USA 79:4927-4931, 
1982). Particularly useful are bovine papilloma virus vectors, which can replicate as 
extrachromosomal elements (Sarver et al., Mol. Cell. Biol. 1:486, 1981). Shortly after 
entry of this DNA into mouse cells, the plasmid replicates to about 100 to 200 copies 
per cell. Transcription of the inserted cDNA does not require integration of the 
plasmid into the host cell chromosome, thereby yielding a high level of expression. 
These vectors can be used for stable expression by including a selectable marker in 
the plasmid, such as, for example, the neo gene. Alternatively, the retroviral genome 
can be modified for use as a vector capable of introducing and directing the 
expression of the GDF receptors gene in host cells (Cone and Mulligan, Proc. Natl. 
Acad. Sci.. USA 81 -.6349-6353, 1984). High level expression can also be achieved 
using inducible promoters, including, but not limited to, the metallothionein IIA 
promoter and heat shock promoters. 

For long term, high yield production of recombinant proteins, stable 
expression is preferred. Rather than using expression vectors which contain viral 
origins of replication, host cells can be transformed with the GDF receptors cDNA 
controlled by appropriate expression control elements such as promoter, enhancer, 
sequences, transcription terminators, and polyadenylation sites, and a selectable 
marker. The selectable marker in the recombinant plasmid can confer resistance to 
the selection, and allows cells to stably integrate the plasmid into their chromosomes 
and grow to form foci, which, in turn can be cloned and expanded into cell lines. For 
example, following the introduction of foreign DNA, engineered cells can be allowed 
to grow for 1 to 2 days in an enriched media, and then are switched to a selective 
media. A number of selection systems can be used, including, but not limited to, the 
herpes simplex virus thymidine kinase (Wigler et al., Cell 1 1:223, 1977), 
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hypoxanthine-guanine phosphoribosyltransferase (Szybalska and Szybalski, Proc. 
Natl. Acad. Sci.. USA 48:2026, 1982), and adenine phosphoribosyltransferase (Lowy, 
et al., Cell 22:817, 1980) genes can be employed in tk", hgprt" or aprt" cells 
respectively. Also, antimetabolite resistance can be used as the basis of selection for 
5 dhfr, which confers resistance to methotrexate (Wigler, et al., Proc. Natl. Acad. Sci. 
USA 77:3567, 1980; O'Hare et al., Proc. Natl. Acad. Sci.. USA 78: 1527, 1981); gpt, 
which confers resistance to mycophenolic acid (Mulligan and Berg, Proc. Natl. Acad. 
Sci., USA 78:2072, 1981); neo, which confers resistance to the aminoglycoside G-418 
(Colberre-Garapin et al., J. Mol. Biol. 150:1, 1981); and hygro, which confers 

10 resistance to hygromycin (Santerre et al., Gene 30:147, 1984) genes. Additional 
selectable genes, including trpB, which allows cells to utilize indole in place of 
tryptophan; hisD, which allows cells to utilize histinol in place of histidine (Hartman 
and Mulligan, Proc. Natl. Acad. Sci.. USA 85:8047, 1988); and ODC (ornithine 
decarboxylase) which confers resistance to the ornithine decarboxylase inhibitor, 

1 5 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue, Curr. Comm. Mol. Biol. 
(Cold Spring Harbor Laboratory Press, 1987), also have been described. 

When the host is a eukaryote, such methods of transfection of DNA as calcium 
phosphate coprecipitates, conventional mechanical procedures such as microinjection, 

20 electroporation, insertion of a plasmid encased in liposomes, or virus vectors can be 
used. Eukaryotic cells can also be cotransformed with DNA sequences encoding the 
GDF receptors of the invention, and a second foreign DNA molecule encoding a 
selectable phenotype, such as the herpes simplex thymidine kinase gene. Another 
method is to use a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine 

25 papilloma virus, to transiently infect or transform eukaryotic cells and express the 
protein. (Gluzman, Eukaryotic Viral Vectors (Cold Spring Harbor Laboratory Press, 
1982)).. 

The invention also provides stable recombinant cell lines, the cells of which 
30 express GDF receptor polypeptides and contain DNA that encodes GDF receptors. 
Suitable cell types include, but are not limited to, NIH 3T3 cells (murine), C2C12 
cells, L6 cells, and PI 9 cells. C2C12 and L6 myoblasts differentiate spontaneously in 
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culture and form myotubes depending on the particular growth conditions (Yaffe and 
Saxel, Nature 270:725-727, 1977; Yaffe, Proc. Natl. Ac ad. Sci.. USA 61:477-483, 
1968). P 19 is an embryonal carcinoma cell line. Such cells are described, for 
example, in the Cell Line Catalog of the American Type Culture Collection (ATCC). 
These cells can be stably transformed using well known methods (see, for example, 
Ausubel et al., supra, 1995, see sections 9.5.1-9.5.6). 

A GDF receptor can be expressed from a recombinant polynucleotide of the 
invention using inducible or constitutive regulatory elements, as described herein. 
The desired protein encoding sequence and an operably linked promoter can be 
introduced into a recipient cell either as a non-replicating DNA (or RNA) molecule, 
which can either be a linear molecule or a covalently closed circular molecule. 
Expression of the desired molecule can occur due to transient expression of the 
introduced sequence, or the polynucleotide can be stably maintained in the cell, for 
example, by integration into a host cell chromosome, thus allowing a more permanent 
expression. Accordingly, the cells can be stably or transiently transformed 
(transfected) cells. 

An example of a vector that can be employed is one which is capable of 
integrating the desired gene sequences into the host cell chromosome. Cells which 
have stably integrated the introduced DNA into their chromosomes can be selected by 
also introducing one or more markers which allow for selection of host cells which 
contain the expression vector. The marker can complement an auxotrophy in the host 
such as leu2, or ura3, which are common yeast auxotrophic markers; can confer a 
biocide resistance, for example, to an antibiotic or to heavy metal ions such as copper, 
or the like. The selectable marker gene can either be directly linked to the DNA gene 
sequences to be expressed, or can be introduced into the same cell by cotransfection. 

The introduced sequence can be incorporated into a plasmid or viral vector 
capable of autonomous replication in the recipient host. Any of a variety of vectors 
can be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include the ease with which recipient cells that contain the 

Gray Cary\GT\6209497.1 
104659-160348 



81 



vector can be recognized and selected from those cells that do not contain the vector; 
the number of copies of the vector desired in a particular host cell; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

For a mammalian host, several vector systems are available for expression. 
One class of vectors utilizes DNA elements that provide autonomously replicating 
extra-chromosomal plasmids derived from animal viruses, for example, a bovine 
papilloma virus, polyoma virus, adenovirus, or SV40 virus. A second class of vectors 
includes vaccinia virus expression vectors. A third class of vectors relies upon the 
integration of the desired gene sequences into the host chromosome. Cells that have 
stably integrated the introduced DNA into their chromosomes can be selected by also 
introducing one or more markers genes (as described above), which allow selection of 
host cells that contain the expression vector. The selectable marker gene can be 
directly linked to the DNA sequences to be expressed, or introduced into the same cell 
by cotransfection. Additional elements can be included to provide for optimal 
synthesis of an encoded mRNA or peptide, including, for example, splice signals, 
transcription promoters or enhancers, and transcription or translation termination 
signals. cDNA expression vectors incorporating appropriate regulatory elements are 
well known in the art (see, for example, Okayama, Mol. Cell. Biol. 3:280, 1983). 

Once the vector or DNA sequence containing the construct has been prepared 
for expression, the DNA construct can be introduced into an appropriate host. 
Various methods can be used for introducing the polynucleotide into a cell, including, 
for example, methods of transfection or transformation such as protoplast fusion, 
calcium phosphate precipitation, and electroporation or other conventional techniques, 
for example, infection where the vector is a viral vector. 

The invention also provides transgenic animals, which have cells that express 
a recombinant GDF receptor. Such transgenic animals can be selected to have 
decreased fat content or increased muscle mass, or both, and, therefore, can be useful 
as a source of food products with high muscle and protein content, and reduced fat 
and cholesterol content. The animals have been altered chromosomally in their germ 
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cells and somatic cells such that the production of a GDF, particularly myostatin, is 
maintained at a normal" level, but the myostatin receptor is produced in reduced 
amounts, or is completely disrupted, resulting in the cells in the animals having a 
decreased ability to bind myostatin and, consequently, having greater than normal 
5 levels of muscle tissue, preferably without increased fat or cholesterol levels. 
Accordingly, the present invention also includes food products provided by the 
animals. Such food products have increased nutritional value because of the increase 
in muscle tissue. The transgenic non-human animals of the invention include bovine, 
porcine, ovine and avian animals, as well as other vertebrates, and further includes 
10 transgenic invertebrates. 

The invention also provides a method of producing animal food products 
having increased muscle content. Such a method can include modifying the genetic 
makeup of the germ cells of a pronuclear embryo of the animal, implanting the 

1 5 embryo into the oviduct of a pseudopregnant female, thereby allowing the embryo to 
mature to full term progeny, testing the progeny for presence of the transgene to 
identify transgene positive progeny, crossbreeding transgene positive progeny to 
obtain further transgene positive progeny, and processing the progeny to obtain a 
foodstuff. The modification of the germ cell comprises altering the genetic 

20 composition so as to reduce or inhibit the expression of the naturally occurring gene 
encoding for production of a myostatin receptor protein. For example, the transgene 
can comprise an antisense molecule that is specific for a polynucleotide encoding a 
myostatin receptor; can comprise a non-functional sequence that replaces or 
intervenes in the endogenous myostatin receptor gene or the transgene; or can encode 

25 a myostatin receptor antagonist, for example, a dominant negative myostatin receptor 
such as a dominant negative Act RIIB. 

As used herein, the term "animal" refers to any bird, fish or mammal, except a 
human, and includes any stage of development, including embryonic and fetal stages. 
30 Farm animals such as pigs, goats, sheep, cows, horses, rabbits and the like; rodents 
such as mice; and domestic pets such as cats and dogs are included within the 
meaning of the term "animal." In addition, the term "organism" is used herein to 
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include animals as described above, as well as other eukaryotes, including, for 
example, other vertebrates such as reptiles and amphibians, as well as invertebrates as 
described above. 

As used herein, the term "transgenic," when used in reference to an animal or 
an organism, means that cells of the animal or organism have been genetically 
manipulated to contain an exogenous polynucleotide sequence that is stably 
maintained with the cells. The manipulation can be, for example, microinjection of a 
polynucleotide or infection with a recombinant virus containing the polynucleotide. 
Thus, the term "transgenic" is used herein to refer to animals (organisms) in which 
one or more cells receive a recombinant polynucleotide, which can be integrated into 
a chromosome in the cell, or can be maintained as an extrachromosomally replicating 
polynucleotide, such as might be engineered into a yeast artificial chromosome. The 
term "transgenic animal" also includes a "germ cell line" transgenic animal. A germ 
cell line transgenic animal is a transgenic animal in which the genetic information has 
been taken up and incorporated into a germ line cell, therefore conferring the ability 
to transfer the information to offspring. If such offspring in fact possess some or all 
of that information, the offspring also are considered to be transgenic animals. The 
invention further encompasses transgenic organisms. 

A transgenic organism can be any organism whose genome has been altered 
by in vitro manipulation of an early stage embryo or a fertilized egg, or by any 
transgenic technology to induce a specific gene knock-out. The term "gene 
knock-out" refers to the targeted disruption of a gene in a cell or in vivo that results in 
complete loss of function. A target gene in a transgenic animal can be rendered 
nonfunctional by an insertion targeted to the gene to be rendered nonfunctional, for 
example, by homologous recombination, or by any other method for disrupting the 
function of a gene in a cell. 

The transgene to be used in the practice of the subject invention can be a DNA 
sequence comprising a modified GDF receptors coding sequence. Preferably, the 
modified GDF receptor gene is one that is disrupted by homologous targeting in 
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embryonic stem cells. For example, the entire mature C-terminal region of the GDF 
receptors gene can be deleted (see Example 13). Optionally, the disruption (or 
deletion) can be accompanied by insertion of or replacement with another 
polynucleotide, for example, a nonfunctional GDF receptor sequence. A "knock-out" 
5 phenotype also can be conferred by introducing or expressing an antisense GDF 
receptor polynucleotide in a cell in the organism, or by expressing an antibody or a 
dominant negative GDF receptor in the cells. Where appropriate, polynucleotides that 
encode proteins having GDF receptor activity, but that differ in nucleotide sequence 
from a naturally occurring GDF gene sequence due to the degeneracy of the genetic 
10 code, can be used herein, as can truncated forms, allelic variants and interspecies 
homologs. 

The present invention also provides antibodies that specifically bind a GDF 
receptor, and that block GDF binding to the receptor. Such antibodies can be useful, 
1 5 for example, for ameliorating a pathologic condition such as a cell proliferative 
disorder associated with muscle tissue. 

A monoclonal antibody that binds specifically to a GDF receptor, particularly 
to a myostatin receptor, can increase the development of skeletal muscles. In 

20 preferred embodiments of the claimed methods, a GDF receptor monoclonal antibody, 
polypeptide, or polynucleotide is administered to a patient suffering from a pathologic 
condition such as a muscle wasting disease, a neuromuscular disorder, muscle 
atrophy, aging, or the like. The GDF receptor antibody, particularly an anti-myostatin 
receptor antibody, can also be administered to a patient suffering from a pathologic 

25 condition such as a muscular dystrophy, spinal cord injury, traumatic injury, 
congestive obstructive pulmonary disease (COPD), AIDS or cachexia. 

In a preferred embodiment, the anti-myostatin receptor antibody is 
administered to a patient with muscle wasting disease or disorder by intravenous, 
30 intramuscular or subcutaneous injection; preferably, a monoclonal antibody is 

administered within a dose range between about 0.1 ug/kg to about 100 mg/kg; more 
preferably between about 1 u.g/kg to 75 mg/kg; most preferably from about 10 mg/kg 
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to 50 mg/kg. The antibody can be administered, for example, by bolus injunction or 
by slow infusion. Slow infusion over a period of 30 minutes to 2 hours is preferred. 
The anti-myostatin receptor antibody, or other anti-GDF receptor antibody, can be 
formulated in a formulation suitable for administration to a patient. Such 
formulations are known in the art. 

The dosage regimen will be determined by the attending physician considering 
various factors which modify the action of the myostatin receptor protein, for 
example, amount of tissue desired to be formed, the site of tissue damage, the 
condition of the damaged tissue, the size of a wound, type of damaged tissue, the 
patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage can vary with the type of matrix used in the 
reconstitution and the types of agent, such as anti-myostatin receptor antibodies, to be 
used in the composition. Generally, systemic or injectable administration, such as 
intravenous, intramuscular or subcutaneous injection. Administration generally is 
initiated at a dose which is minimally effective, and the dose is increased over a 
preselected time course until a positive effect is observed. Subsequently, incremental 
increases in dosage are made limiting such incremental increases to such levels that 
produce a corresponding increase in effect, while taking into account any adverse 
affects that can appear. The addition of other known growth factors, such as IGF I 
(insulin like growth factor I), human, bovine, or chicken growth hormone, which can 
aid in increasing muscle mass, to the final composition, can also affect the dosage. In 
the embodiment where an anti-myostatin receptor antibody is administered, the 
antibody is generally administered within a dose range of about 0.1 p.g/kg to about 
100 mg/kg.; more preferably between about 10 mg/kg to 50 mg/kg. 

As used herein, the term "antibody" is used in its broadest sense to include 
polyclonal and monoclonal antibodies, as well as antigen binding fragments of such 
antibodies. An antibody useful in a method of the invention, or an antigen binding 
fragment thereof, is characterized, for example, by having specific binding activity for 
an epitope of a GDF receptor, for example, a myostatin receptor. In addition, as 
discussed above, an antibody of the invention can be an antibody that specifically 
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binds a peptide portion of a promyostatin polypeptide, particularly a myostatin 
prodomain or functional peptide portion thereof. It will be recognized that the 
following methods, which exemplify the preparation and characterization of GDF 
receptor antibodies, further are applicable to the preparation and characterization of 
5 additional antibodies of the invention, including antibodies that specifically bind a 
myostatin prodomain, antibodies that specifically bind a promyostatin polypeptide 
and reduce or inhibit proteolytic cleavage of the promyostatin to myostatin, and the 
like 

10 The term "binds specifically" or "specific binding activity," when used in 

reference to an antibody means that an interaction of the antibody and a particular 
epitope has a dissociation constant of at least about 1 x 10" 6 , generally at least about 
1 x 10" 7 , usually at least about 1 x 10" 8 , and particularly at least about 1 x 10" 9 or 
1 x 10" 10 or less. As such, Fab, F(ab')2, Fd and Fv fragments of an antibody that retain 

1 5 specific binding activity for an epitope of a GDF receptor, are included within the 
definition of an antibody. For purposes of the present invention, an antibody that 
reacts specifically with an epitope of a myostatin receptor, for example, is considered 
to not substantially react with a TGF-f} receptor or a BMP receptor if the antibody has 
at least a two-fold greater binding affinity, generally at least a five-fold greater 

20 binding affinity, and particularly at least a ten-fold greater binding affinity for the 
myostatin receptor as compared to the TGF-(3 or BMP receptor. 

The term "antibody" as used herein includes naturally occurring antibodies as 
well as non-naturally occurring antibodies, including, for example, single chain 
antibodies, chimeric, bifunctional and humanized antibodies, as well as antigen- 

25 binding fragments thereof. Such non-naturally occurring antibodies can be 

constructed using solid phase peptide synthesis, can be produced recombinantly or 
can be obtained, for example, by screening combinatorial libraries consisting of 
variable heavy chains and variable light chains (see Huse et al., Science 
246:1275-1281 (1989), which is incorporated herein by reference). These and other 

30 methods of making, for example, chimeric, humanized, CDR-grafted, single chain, 
and bifunctional antibodies are well known to those skilled in the art (Winter and 
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Harris, Immunol. Today 14:243-246, 1993; Ward et al., Nature 341:544-546, 1989; 
Harlow and Lane, Antibodies: A laboratory manual (Cold Spring Harbor Laboratory 
Press, 1-988); Hilyard et al, Protein Engineering: A practical approach (IRL Press 
1992); Borrabeck, Antibody Engineering . 2d ed. (Oxford University Press 1995); 
each of which is incorporated herein by reference). 

Antibodies that bind specifically with a GDF receptor can be raised using the 
receptor as an immunogen and removing antibodies that crossreact, for example, with 
a TGF-p type I or type II receptor, with an activin receptor such as Act RIB, Act RIIA 
or Act RIIB, or a BMP receptors such as BMP RII, BMP RIA and BMP RIB (see 
Massague, supra, 1998). An antibody of the invention conveniently can be raised 
using a peptide portion of a myostatin receptor that is not present in a TGF-P, activin, 
or BMP receptor. Similarly, an antibody that specifically binds a myostatin 
prodomain can be raised using the prodomain, or a functional peptide portion thereof 
as the immunogen. Where such a peptide is non-immunogenic, it can be made 
immunogenic by coupling the hapten to a carrier molecule such as bovine serum 
albumin (BSA) or keyhole limpet hemocyanin (KLH), or by expressing the peptide 
portion as a fusion protein. Various other carrier molecules and methods for coupling 
a hapten to a carrier molecule are well known in the art (see, for example, by Harlow 
and Lane, supra, 1988). 

If desired, a kit incorporating an antibody or other agent useful in a method of 
the invention can be prepared. Such a kit can contain, in addition to the agent, a 
pharmaceutical composition in which the agent can be reconstituted for administration 
to a subject. The kit also can contain, for example, reagents for detecting the 
antibody, or for detecting specific binding of the antibody to a GDF receptor. Such 
detectable reagents useful for labeling or otherwise identifying the antibody are 
described herein and known in the art. 

Methods for raising polyclonal antibodies, for example, in a rabbit, goat, 
mouse or other mammal, are well known in the art (see, for example, Green et al., 
"Production of Polyclonal Antisera," in Immunoc hemical Protocols (Manson, ed., 
Humana Press 1992), pages 1-5; Coligan et al., "Production of Polyclonal Antisera in 
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Rabbits, Rats, Mice and Hamsters," in Curr. Protocols Immunol. (1992). section 2.4.1; 
each or which is incorporated herein by reference). In addition, monoclonal 
antibodies can be obtained using methods that are well known and routine in the art 
(Harlow and Lane, supra, 1988). For example, spleen cells from a mouse immunized 
5 with a myostatin receptor, or an epitopic fragment thereof, can be fused to an 

appropriate myeloma cell line such as SP/02 myeloma cells to produce hybridoma 
cells. Cloned hybridoma cell lines can be screened using labeled antigen to identify 
clones that secrete monoclonal antibodies having the appropriate specificity, and 
hybridomas expressing antibodies having a desirable specificity and affinity can be 

1 0 isolated and utilized as a continuous source of the antibodies. The antibodies can be 
further screened for the inability to bind specifically with the myostatin receptor. 
Such antibodies are useful, for example, for preparing standardized kits for clinical 
use. A recombinant phage that expresses, for example, a single chain anti-myostatin 
receptor antibody also provides an antibody that can used for preparing standardized 

1 5 kits. 

Methods of preparing monoclonal antibodies well known (see, for example, 
Kohler and Milstein, Nature 256:495, 1975, which is incorporated herein by 
reference; see, also, Coligan et al., supra, 1992, see sections 2.5.1-2.6.7; Harlow and 
Lane, supra, 1988). Briefly, monoclonal antibodies can be obtained by injecting mice 
20 with a composition comprising an antigen, verifying the presence of antibody 
production by removing a serum sample, removing the spleen to obtain 
B lymphocytes, fusing the B lymphocytes with myeloma cells to produce hybridomas, 
cloning the hybridomas, selecting positive clones that produce antibodies to the 
antigen, and isolating the antibodies from the hybridoma cultures. 

25 

Monoclonal antibodies can be isolated and purified from hybridoma cultures 
by a variety of well established techniques, including, for example, affinity 
chromatography with Protein-A SEPHAROSE, size exclusion chromatography, and 
ion exchange chromatography (Coligan et al., supra, 1992, see sections 2.7.1-2.7.12 
30 and sections 2.9. 1-2.9.3; see, also, Barnes et al., "Purification of Immunoglobulin G 
(IgG)," in Meth. :Molec. Biol. 10:79-104 (Humana Press 1992), which is incorporated 
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herein by reference). Methods of in vitro and in vivo multiplication of monoclonal 
antibodies is well known to those skilled in the art. Multiplication in vitro can be 
carried out in suitable culture media such as Dulbecco's Modified Eagle Medium or 
RPMI 1640 medium, optionally replenished by a mammalian serum such as fetal calf 
5 serum or trace elements and growth sustaining supplements such as normal mouse 
peritoneal exudate cells, spleen cells, bone marrow macrophages. Production in vitro 
provides relatively pure antibody preparations and allows scale-up to yield large 
amounts of the desired antibodies. Large scale hybridoma cultivation can be carried 
out by homogenous suspension culture in an airlift reactor, in a continuous stirrer 

1 0 reactor,, or in immobilized or entrapped cell culture. Multiplication in vivo can be 
carried out by injecting cell clones into mammals histocompatible with the parent 
cells, for example, syngeneic mice, to cause growth of antibody producing tumors. 
Optionally, the animals are primed with a hydrocarbon, especially oils such as 
pristane (tetramethylpentadecane) prior to injection. After one to three weeks, the 

1 5 desired monoclonal antibody is recovered from the body fluid of the animal. 

Therapeutic applications for antibodies disclosed herein are also part of the 
present invention. For example, antibodies of the present invention can also be 
derived from subhuman primate antibody. General techniques for raising 
20 therapeutically useful antibodies in baboons can be found, for example, in Goldenberg 
et al., International Patent Publication WO 91/1 1465 (1991); and Losman et al., Int. J. 
Cancer 46:3 10, 1990, each of which is incorporated herein by reference. 

A therapeutically useful anti-GDF receptor antibody also can be derived from 
25 a "humanized" monoclonal antibody. Humanized monoclonal antibodies are 

produced by transferring mouse complementarity determining regions from heavy and 
light variable chains of the mouse immunoglobulin into a human variable domain, and 
then substituting human residues in the framework regions of the murine counterparts. 
The use of antibody components derived from humanized monoclonal antibodies 
30 obviates potential problems associated with the immunogenicity of murine constant 
regions: General techniques for cloning murine immunoglobulin variable domains 
are known (see, for example, Orlandi et al., Proc. Natl. Acad. Sci.. USA 86:3833, 
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1989, which is hereby incorporated in its entirety by reference). Techniques for 
producing humanized monoclonal antibodies also are known (see, for example, Jones 
et al., Nature 321 :522, 1986; Riechmann et al., Nature 332:323, 1988; Verhoeyen et 
al, Science 239:1534, 1988; Carter et al., Proc. Natl. Acad. Sci.. USA 89:4285, 1992; 
Sandhu, Pit. Rev. Biotechnol. 12:437, 1992; and Singer et al., J. Immunol. 150:2844, 
1993; each of which is incorporated herein by reference). 

Antibodies of the invention also can be derived from human antibody 
fragments isolated from a combinatorial immunoglobulin library (see, for example, 
Barbas et al., METHODS: A Companion to Methods in Immunology 2: 1 1 9, 1 99 1 ; 
Winter et al., Ann. Rev. Immunol. 12:433, 1994; each of which is incorporated herein 
by reference). Cloning and expression vectors that are useful for producing a human 
immunoglobulin phage library can be obtained, for example, from STRATAGENE 
Cloning Systems (La Jolla, CA). 

An antibody of the invention also can be derived from a human monoclonal 
antibody. Such antibodies are obtained from transgenic mice that have been 
"engineered" to produce specific human antibodies in response to antigenic challenge. 
In this technique, elements of the human heavy and light chain loci are introduced 
into strains of mice derived from embryonic stem cell lines that contain targeted 
disruptions of the endogenous heavy and light chain loci. The transgenic mice can 
synthesize human antibodies specific for human antigens, and the mice can be used to 
produce human antibody-secreting hybridomas. Methods for obtaining human 
antibodies from transgenic mice are described, for example, by Green et al., Nature 
Genet. 7:13, 1994; Lonberg et al., Nature 368:856, 1994; and Taylor et al., Int 
Immunol. 6:579, 1994; each of which is incorporated herein by reference. 

Antibody fragments of the present invention can be prepared by proteolytic 
hydrolysis of the antibody or by expression in E. coli of DNA encoding the fragment. 
Antibody fragments can be obtained by pepsin or papain digestion of whole 
antibodies by conventional methods. For example, antibody fragments can be 
produced by enzymatic cleavage of antibodies with pepsin to provide a 5S fragment 
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denoted F(ab')2- This fragment can be further cleaved using a thiol reducing agent, 
and optionally a blocking group for the sulfhydryl groups resulting from cleavage of 
disulfide linkages, to produce 3.5S Fab' monovalent fragments. Alternatively, an 
enzymatic cleavage using pepsin produces two monovalent Fab' fragments and an Fc 
fragment directly (see, for example, Goldenberg, U.S. Patent No. 4,036,945 and U.S. 
Patent No. 4,331,647, each of which is incorporated by reference, and references 
contained therein; Nisonhoff et al., Arch. Biochem. Biophvs . 89:230. 1960; Porter, 
Biochem. J. 73:1 19, 1959; Edelman et al., Meth. EnzvmoL . 1:422 (Academic Press 
1967), each of which is incorporated herein by reference; see, also, Coligan et al., 
supra, 1992, see sections 2.8.1-2.8.10 and 2.10.1-2.10.4). 

Other methods of cleaving antibodies, such as separation of heavy chains to 
form monovalent light/heavy chain fragments, further cleavage of fragments, or other 
enzymatic, chemical, or genetic techniques can also be used, provided the fragments 
specifically bind to the antigen that is recognized by the intact antibody. For example, 
Fv fragments comprise an association of V H and V L chains. This association can be 
noncovalent (Inbar et al., Proc. Natl. Acad. Sci.. USA 69:2659, 1972). Alternatively, 
the variable chains can be linked by an intermolecular disulfide bond or cross-linked 
by chemicals such as glutaraldehyde ( Sandhu, supra, 1992). Preferably, the 
Fv fragments comprise V H and V L chains connected by a peptide linker. These 
single-chain antigen binding proteins (sFv) are prepared by constructing a structural 
gene comprising DNA sequences encoding the V H and V L domains connected by an 
oligonucleotide. The structural gene is inserted into an expression vector, which is 
subsequently introduced into a host cell such as E. coli. The recombinant host cells 
synthesize a single polypeptide chain with a linker peptide bridging the two 
V domains. Methods for producing sFvs are described, for example, by Whitlow et 
al., Methods: A Companion to Methods in Enzymology 2:97, 1991; Bird et al., 
Science 242:423-426, 1988; Ladner et al., U.S. patent No. 4,946,778; Pack et al., 
Bio/Technology 1 1 : 1271 -1277, 1993; each of which is incorporated herein by 
reference; see, also Sandhu, supra, 1992. 
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Another form of an antibody fragment is a peptide coding for a single 
complementarity-determining region (CDR). CDR peptides ("minimal recognition 
units") can be obtained by constructing genes encoding the CDR of an antibody of 
interest. Such genes are prepared, for example, by using the polymerase chain 
5 reaction to synthesize the variable region from RNA of antibody-producing cells (see, 
for example, Larrick et al., Methods: A Companion to Methods in Enzvmology 2:106, 
1991, which is incorporated herein by reference). 

The invention also provides a method for identifying a GDF receptor 
1 0 polypeptide. Such a method can be performed, for example, by incubating 

components comprising GDF polypeptide and a cell expressing a full length receptor 
or truncated receptor under conditions sufficient to allow the GDF to bind to the 
receptor; measuring the binding of the GDF polypeptide to the receptor; and isolating 
the receptor. The GDF can be any of the known GDFs (e.g., GDF-1-16), and 
1 5 preferably is GDF-8 (myostatin) or GDF- 1 1 . Methods of isolating the receptors are 
described in more detail in the Examples section below. Accordingly, the invention 
also provides a substantially purified GDF receptor, as well as peptides and peptide 
derivatives of a GDF receptor that have fewer amino acid residues than a naturally 
occurring GDF receptor. Such peptides and peptide derivatives are useful as research 
20 and diagnostic tools in the study of muscle wasting diseases and the development of 
more effective therapeutics. 

The invention further provides GDF receptor variants. As used herein, the 
term "GDF receptors variant" means a molecule that simulates at least part of the 

25 structure of GDF receptors. GDF receptor variants can be useful in reducing or 
inhibiting GDF binding, thereby ameliorating a pathologic condition as disclosed 
herein. Examples of GDF receptor variants include, but are not limited to, truncated 
GDF receptors such as a soluble extracellular domain of a GDF receptor; a dominant 
negative GDF receptor such as a dominant negative Act RIIB receptor, which lacks 

30 kinase activity; or other truncated or mutant GDF receptors. 
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The invention relates not only to peptides and peptide derivatives of a 
naturally-occurring GDF receptor, but also to GDF receptor variants, including 
mutants GDF receptors, and chemically synthesized derivatives of GDF receptors that 
specifically bind a GDF, for example, myostatin. For example, changes in the amino 
5 acid sequence of a GDF receptor are contemplated in the present invention. GDF 
receptors can be altered by changing the DNA encoding the protein. Preferably, only 
conservative amino acid alterations are undertaken, using amino acids that have the 
same or similar properties. Illustrative amino acid substitutions include the changes 
of alanine to serine; arginine to lysine; asparagine to glutamine or histidine; aspartate 

10 to glutamate; cysteine to serine; glutamine to asparagine; glutamate to aspartate; 
glycine to proline; histidine to asparagine or glutamine; isoleucine to leucine or 
valine; leucine to valine or isoleucine; lysine to arginine, glutamine, or glutamate; 
methionine to leucine or isoleucine; phenylalanine to tyrosine, leucine or methionine; 
serine to threonine; threonine to serine; tryptophan to tyrosine; tyrosine to tryptophan 

1 5 or phenylalanine; valine to isoleucine or leucine. 

Variants useful for the present invention comprise analogs, homologs, muteins 
and mimetics of a GDF receptor that retain the ability to specifically bind to their 
respective GDFs. In another embodiment, variant GDF receptors that have dominant 

20 negative activity also are contemplated, regardless of whether the variant also 

interacts specifically with its GDF. Peptides of the GDF receptors refer to portions of 
the amino acid sequence of GDF receptors that have these abilities. The variants can 
be generated directly from GDF receptors itself by chemical modification, by 
proteolytic enzyme digestion, or by combinations thereof. Additionally, genetic 

25 engineering techniques, as well as methods of synthesizing polypeptides directly from 
amino acid residues, can be employed. 

Peptides can be synthesized by such commonly used methods as t-BOC or 
FMOC protection of alpha-amino groups. Both methods involve stepwise syntheses 
30 whereby a single amino acid is added at each step starting from the C terminus of the 
peptide (Coligan, et al., Current Protocols in Immunology (Wiley Interscience, 1991), 
Unit 9, which is incorporated herein by reference). Peptides of the invention can also 
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be synthesized by the well known solid phase peptide synthesis methods (Merrifield, 
J. Am. Chem. Soc . 85:2149, 1962; Stewart and Young, Solid Phase Peptides 
Synthesis (Freeman, San Francisco, 1969), see pages 27-62, each of which is 
incorporated herein by reference), using a copoly(styrene-divinylbenzene) containing 
5 0. 1 -1 .0 mMol amines/g polymer. On completion of chemical synthesis, the peptides 
can be deprotected and cleaved from the polymer by treatment with liquid HF-10% 
anisole for about 1/4-1 hours at 0°C. After evaporation of the reagents, the peptides 
are extracted from the polymer with 1% acetic acid solution which is then lyophilized 
to yield the crude material. This can normally be purified by such techniques as gel 

1 0 filtration on Sephadex G-l 5 using 5% acetic acid as a solvent. Lyophilization of 
appropriate fractions of the column will yield the homogeneous peptide or peptide 
derivatives, which can then be characterized by such standard techniques as amino 
acid analysis, thin layer chromatography, high performance liquid chromatography, 
ultraviolet absorption spectroscopy, molar rotation, solubility, and quantitated by the 

1 5 solid phase Edman degradation. 

Non-peptide compounds that mimic the binding and function of GDF 
receptors ("mimetics") can be produced by the approach outlined by Saragovi et al. 
(Science 253: 792-95, 1991, which is incorporated herein by reference). Mimetics are 

20 molecules which mimic elements of protein secondary structure (Johnson et al., 
"Peptide Turn Mimetics," in Biotechnology and Pharmacy (Pezzuto et al., Eds.; 
Chapman and Hall, New York 1993), which is incorporated herein by reference). The 
underlying rationale behind the use of peptide mimetics is that the peptide backbone 
of proteins exists chiefly to orient amino acid side chains in such a way as to facilitate 

25 molecular interactions. For the purposes of the present invention, an appropriate 
mimetic can be considered to be the equivalent of a GDF receptor. 

Longer peptides can be produced by the "native chemical" ligation technique 
which links together peptides (Dawson et al., Science 266:776, 1994, which is 
30 incorporated herein by reference). Variants can be created by recombinant techniques 
employing genomic or cDNA cloning methods. Site specific and region directed 
mutagenesis techniques can be employed (Ausubel et al., supra, 1989 and 1990 to 
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1993 supplements), see volume 1, chapter 8; Protein Engineering (Oxender and Fox 
eds., A. Liss, Inc., 1987)). In addition, linker scanning and PCR mediated techniques 
can be employed for mutagenesis (Erlich, PCR Technology (Stockton Press 1989); 
Ausubel et al., supra, 1989 to 1993). Protein sequencing, structure and modeling 
5 approaches for use with any of the above techniques are disclosed in the above cited 
references. 

The present invention also provides GDF receptor-binding agents that block 
the specific binding of a GDF to its receptor. Such agents are useful, for example, as 

10 research and diagnostic tools in the study of muscle wasting disorder as described 
above and as effective therapeutics, and can be identified using the methods as 
disclosed herein, for example, a molecular modeling method. In addition, 
pharmaceutical compositions comprising GDF receptor-binding agents can represent 
effective therapeutics. In the context of the invention, the phrase "GDF receptor- 

1 5 binding agent" denotes a naturally occurring ligand of a GDF receptor, for example, 
GDF-1 to GDF-16; a synthetic ligand of GDF receptors, or an appropriate derivative 
of the natural or synthetic ligands. The determination and isolation of ligands is well 
known in the art (Lerner, Trends Neurosci. 17:142-146, 1994, which is incorporated 
herein by reference). 

20 

In yet another embodiment, the present invention relates to GDF receptor- 
binding agents that interfere with binding between a GDF receptor and a GDF. Such 
binding agents can interfere by competitive inhibition, by non-competitive inhibition 
or by uncompetitive inhibition. Interference with normal binding between GDF 
25 receptors and one or more GDF can result in a useful pharmacological effect. 

The invention also provides a method for identifying a composition that binds 
to a GDF receptor. The method includes incubating components comprising the 
composition and a GDF receptor under conditions sufficient to allow the components 
30 to interact specifically, and measuring the binding of the composition to GDF 
receptors. Compositions that bind to GDF receptors include peptides, 
peptidomimetics, polypeptides, chemical compounds and biologic agents as described 
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above. Incubating includes exposing the reactants to conditions that allow contact 
between the test composition and GDF receptors, and provide conditions suitable for 
a specific interaction as would occur in vivo. Contacting can be in solution or in solid 
phase. The test ligand/composition can optionally be a combinatorial library for 
5 screening a plurality of compositions, as described above. Compositions identified in 
the method of the invention can be further evaluated, detected, cloned, sequenced, and 
the like, either in solution or after binding to a solid support, by any method usually 
applied to the detection of a specific DNA sequence such as PCR, oligomer restriction 
(Saiki et al., Rio/Technologv 3:1008-1012, 1985, which is incorporated herein by 
1 0 reference), allele-specific oligonucleotide (ASO) probe analysis (Conner et al., Proc. 
Natl. Acad. Sci.. USA 80:278, 1983, which is incorporated herein by reference), 
oligonucleotide ligation assays (OLAs) (Landegren et al., Science 241:1077 1988, 
which is incorporated herein by reference), and the like (see Landegren et al., Science 
242:229-237, 1988. which is incorporated herein by reference). 

15 

To determine if a composition can functionally complex with the receptor 
protein, induction of an exogenous gene can be monitored by monitoring changes in 
the protein level of a protein encoded for by the exogenous gene, or any other method 
as disclosed herein. When a compositions is identified that can induce transcription 
20 of the exogenous gene, it is concluded that this composition can specifically bind to 
the receptor protein coded for by the nucleic acid encoding the initial sample test 
composition. 

Expression of the exogenous gene can be monitored by a functional assay or 
25 assay for a protein product, for example. The exogenous gene is therefore a gene that 
provides an assayable/measurable expression product in order to allow detection of 
expression of the exogenous gene. Such exogenous genes include, but are not limited 
to, reporter genes such as chloramphenicol acetyltransferase gene, an alkaline 
phosphatase gene, (3-galactosidase, a luciferase gene, a green fluorescent protein gene, 
30 guanine xanthine phosphoribosyltransf erase, alkaline phosphatase, and antibiotic 
resistance genes such as neomycin phosphotransferase (see above). 
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Expression of the exogenous gene is indicative of a specific interaction of a 
composition and a GDF receptor; thus, the binding or blocking composition can be 
identified and isolated. The compositions of the present invention can be extracted 
and purified from the culture media or a cell by using known protein purification 
5 techniques commonly employed, including, for example, extraction, precipitation, ion 
exchange chromatography, affinity chromatography, gel filtration and the like. 
Compositions can be isolated by affinity chromatography using the modified receptor 
protein extracellular domain bound to a column matrix or by heparin chromatography. 

1 0 Also included in the screening method of the invention is combinatorial 

chemistry methods for identifying chemical compounds that bind to GDF receptors, 
as described above. Thus, the screening method is also useful for identifying variants, 
binding or blocking agents, etc., which functionally, if not physically (e.g., sterically) 
act as antagonists or agonists, as desired. 

15 

The following examples are intended to illustrate but not limit the invention. 
EXAMPLE 1 

- MYOSTATIN ACTS IN A DOSE DEPENDENT MANNER 

20 

This example demonstrates that the activity of myostatin in inhibiting muscle 
growth is dependent on the level of myostatin expression in vivo. 

Myostatin is a negative regulator of skeletal muscle mass (McPherron et al., 
25 supra, 1 997; McPherron and Lee, supra, 1 997). Myostatin knock-out mice that were 
homozygous for a deletion of the myostatin gene had a 25-30% increase in total body 
mass. An examination of the homozygous knock-out mice revealed that the increased 
muscle mass was due to about a 100-200% increase in skeletal muscle mass 
throughout the body. 

30 

Mice that were heterozygous for the myostatin mutation also had an increase 
in total body mass. However, the increase mass of the heterozygotes was less than 
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that of the homozygotes, and was statistically significant in only one age and sex 
group among the many examined. In order to determine whether hetero2ygous mice 
have an intermediate phenotype between that of wild type mice and homo2ygous 
myostatin knock-out mice, the analysis of muscle weights was extended to the 
5 heterozygous mice. Individual muscles sampled from heterozygous mice weighed 
approximately 25-50% more than those of wild type mice. These results demonstrate 
that mice that are heterozygous for deletion of a myostatin gene have a phenotype that 
is intermediate between that of wild type mice and homozygous myostatin knock-out 
mice, and demonstrate that myostatin produces a dose-dependent effect in vivo. 

10 

These results indicate that the manipulation of myostatin activity can be useful 
in treating muscle wasting diseases and other metabolic disorders associated with 
myostatin activity. Furthermore, the dose-dependent effect of myostatin indicates that 
a therapeutic effect can be obtained without achieving complete inhibition of 
15 myostatin activity, thereby allowing for an adjustment of myostatin activity if, for 
example, a certain level of activity produces undesirable effects in a subject. 

EXAMPLE 2 

MYOSTATIN EFFECT DECREASES WTTH AGE IN KNOCK-OUT MICE 
20 This example demonstrates that a decreased difference in body weight 

between wild type mice and homozygous myostatin knock-out mice is associated with 
a decline in muscle weight of the mutant mice. 

Myostatin knock-out mice weighed approximately 25-30% more than wild 
25 type mice at five months of age (McPherron et al., supra, 1997). However, this 
difference in total body weights became significantly smaller or disappeared 
altogether as the animals aged. In order to determine whether this effect was due to a 
relative' loss of weight in the knock-out mice due, for example, to muscle 
degeneration, or to a relatively greater weight gain by wild type mice, a detailed 
30 analysis of muscle weights was made as a function of age. 
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At all ages examined from 2 months to 17 months, the pectoralis muscle 
weighed significantly more in homozygous mutant mice than in wild type littermates. 
The most dramatic difference was observed at 5 months, at which time the pectoralis 
weight was approximately 200% greater in the mutant mice. Although the pectoralis 
5 weight declined slightly at older ages, the weight of this muscle in mutant mice 
remained greater than twice that of wild type mice. This same basic trend was 
observed in all of the other muscles examined, including the triceps brachii, the 
quadriceps, the gastrocnemius and plantaris, and the tibialis anterior. Similar trends 
were observed in both male and female mice. These results demonstrate that the 
1 0 decreased difference in total body weights between mutant and wild type mice 

observed with aging is due to a slight decline in muscle weights in the mutant mice. 



EXAMPLE 3 

MYOSTATIN AFFECTS FAT ACCUMULATION 
15 IN A DOSE DEPENDENT MANNER 

This example demonstrates that myostatin knock-out mice fail to accumulate 
fat, and that the decrease in fat accumulation is associated with the level of myostatin 
expression in vivo. 

20 Since the decline in muscle weights in myostatin mutants, as demonstrated in 

Example 2, did not fully account for the observation that the wild type animals 
eventually weighed about the same as the mutant mice, the amount of fat 
accumulation in wild type and mutant mice was examined. The inguinal, epididymal 
and retroperitoneal fat pads in male mice were examined. There was no difference in 

25 the weights of any of these fat pads between wild type and mutant mice at two months 
of age. By 5 to 6 months of age, wild type and heterozygous knock-out mice both 
exhibited a large range of fat pad weights, and, on average, fad pad weights increased 
by approximately 3 -fold to 5-fold by the time the animals reached 9 to 10 months of 
age. Due to the large range of fat pad weights observed in these animals, some 

30 animals showed a much larger increase (up to 10-fold) than others. 
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In contrast to the wild type and heterozygous knock-out mice, the fat pad 
weights of myostatin homozygous mutant mice were in a relatively narrow range and 
were virtually identical in 2 month old mice and in 9 to 10 month old mice. Thus, the 
increased fat accumulation that occurred with aging in the wild type mice was not 
5 observed in the homozygous myostatin knock-out mice. This difference in fat 

accumulation, together with the slight decline in muscle weights, as a function of age 
in the homozygous mutant mice fully accounted for the observation that the wild type 
animals eventually have the same total body weight as the mutants. 

1 0 The mean fat pad weights of heterozygous knock-out mice at 9 to 1 0 months 

of age was intermediate between that of the wild type mice and the homozygous 
mutant mice. Although this difference was not statistically significant, due to the 
wide range of fat pad weights in these and the wild type mice, these results 
nevertheless indicate that myostatin has a dose-dependent effect on the accumulation 

1 5 of fat, similar to its effect on muscle growth. 

EXAMPLE 4 

EFFECT OF MYOSTATIN ON METABOLISM 
This example demonstrates that serum insulin and glucose levels, as well as 
20 metabolic activity, are affected by the level of myostatin expression. 

In order to determine whether the skeletal muscle hypertrophy and the lack of 
fat accumulation in the myostatin mutant mice is due to an effect on overall 
metabolism, the metabolic profile of the mutant mice was examined. Serum 

25 triglyceride and serum cholesterol levels were significantly lower in myostatin mutant 
mice as compared to wild type control mice (Table 1). Serum insulin levels also 
appeared to be lower in the myostatin mutant mice. However, the fed and fasting 
glucose levels both were indistinguishable among homozygous mutant mice and wild 
type mice (Table 1), and both groups of mice had a normal response in a glucose 

30 tolerance test. The results demonstrate that the homozygous myostatin knock-out 
mice can maintain normal levels of serum glucose even though their serum insulin 
level is lower than that of wild type animals. 
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Table 1. SERUM PARAMETERS 
+/+ -/- 

triglycerides (mg/dl) 131.5+/- 16.5 66.8+/- 11.4 p = 0.012 

cholesterol (mg/dl) 138.3+/- 8.1 94.5+/- 6.8 p = 0.0034 

fed glucose (mg/dl) 114.0+/- 4.8 119.3+/- 5.2 n.s.(p=0.43) 

fasting glucose (mg/dl) 86.5+/- 3.8 103.3 + 9.3 n.s. (p=0.13) 

+/+ indicates wild type mice; -/- indicate homozygous knock-out mice 

In order to determine whether differences in metabolic rates could explain the 
lack of fat accumulation in the mutant mice, the rate of oxygen consumption of wild 
type and mutant mice was compared using a calorimeter. Mutant mice had a lower 
basal metabolic rate and a lower overall metabolic rate than wild type control mice. 
These results indicate that the lack of fat accumulation in the myostatin mutant mice 
is not due to a higher rate of metabolic activity. 

EXAMPLE 5 

MYOSTATIN AFFECTS FAT ACCUMULATION 
IN GENETICALLY OBESE MICE 
This example demonstrates that a lack of myostatin expression suppresses fat 
accumulation in mice that are a genetic model for obesity. 

In order to determine whether the loss of myostatin activity could suppress fat 
accumulation not only in normal mice but also in obese mice, the effect of the 
myostatin mutation in agouti lethal yellow (A y ) mice, which represent a genetic model 
of obesity (Yen et al., FASEB J. 8:479-488, 1994), was examined. Mice that were 
doubly heterozygous for the lethal yellow and myostatin mutations were generated, 
and offspring from crosses of these doubly heterozygous mice were examined. 
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The total body weight of the A y /a, myostatin -/- double mutant mouse was 
dramatically reduced (approximately 9 grams) compared to that of the A y /a, myostatin 
+/+ mouse. This reduction in total body weight was even more dramatic considering 
that the A y /a, myostatin -/- double mutant had about 2 to 3 times more skeletal muscle 
5 than did the A Y /a, myostatin +/+ mouse. The double mutant had approximately 10 
grams more muscle than the A y /a, myostatin +/+ mouse and, therefore, the total 
weight reduction in the rest of the tissues was about 19 grams. 

The reduction in total body weight resulted from a reduction in overall fat 
1 0 content. As shown in Table 2, the weights of the parametrial and retroperitoneal fat 
pads were reduced 5 -fold to 6-fold in the A y /a, myostatin -/- double mutant as 
compared to the A y /a, myostatin +/+ mouse. These results indicate that the presence 
of the myostatin mutation dramatically suppresses fat accumulation in obesity. 

1 5 The presence of the myostatin mutation also dramatically affected glucose 

metabolism. Agouti lethal mice lacking the myostatin mutation had grossly abnormal 
glucose tolerance test results, with serum glucose levels often reaching 450 to 
600 mg/dl and only slowly recovering to baseline levels over a period of 4 hours. 
Female agouti lethal mice were affected less than male mice, and some females 

20 responded almost normally in this test, as previously described (see Yen et al., supra, 
1994). In contrast, although the A y /a, myostatin -/- mice had slightly abnormal 
glucose tolerance tests, but none of these animals had the gross abnormalities 
observed in the A y /a myostatin +/+ mice. 

25 These results indicate that the myostatin mutation at least partially suppressed 

the development of abnormal glucose metabolism in the agouti lethal mice. 
Significantly, mice that were heterozygous for the myostatin mutation had an 
intermediate response compared to myostatin +/+ and myostatin -/- mice, thus 
confirming the dose-dependent effect of myostatin. 
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EXAMPLE 6 

PURIFICATION OF RECOMBINANT MYOSTATIN 
This example provides a method for preparing and isolating recombinant 
myostatin. 

5 

In order to elucidate the biological activity of myostatin, large quantities of 
myostatin protein were purified for bioassays. Stable Chinese hamster ovary (CHO) 
cell lines producing high levels of myostatin protein were generated by co-amplifying 
a myostatin expression cassette with a dihydrofolate reductase cassette using a 

10 methotrexate selection scheme (McPherron et al., supra, 1997). Myostatin was 
purified from the conditioned medium of the highest producing line by successive 
fractionation on hydroxy apatite, lentil lectin SEPH AROSE, DEAE agarose, and 
heparin SEPHAROSE. Silver stain analysis revealed that the purified protein 
obtained following these four column chromatography steps (referred to as "heparin 

15 eluate") consisted of two species with molecular masses of approximately 
35 kilodaltons (kDa) and 12 kDa. 

The purified protein preparation was determined by various criteria to 
represent a complex of two myostatin prodomain peptides and a disulfide-linked 

20 dimer of mature C-terminal myostatin peptides. First, western blot analysis, using 

antibodies raised against specific portions of the promyostatin sequence, identified the 
35 kDa band as the prodomain and the 12 kDa band as the mature C-terminal peptide. 
Second, under non-reducing conditions, the species reacting with antibodies directed 
against the mature C-terminal peptide had an electrophoretic mobility consistent with 

25 a disulfide linked dimer. Third, the molar ratio of prodomain to mature C-terminal 
peptide was approximately 1:1. Fourth, the prodomain and mature C-terminal peptide 
copurified through the four column chromatography steps. Finally, the mature 
C-terminal peptide bound to the lentil lectin column even though the C-terminal 
region does not contain consensus N-linked glycosylation signals, indicating that the 

30 mature C-terminal peptide bound to the column due to its interaction with the 
prodomain peptide, which contains a potential N-linked glycosylation site. 
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These results indicate that myostatin produced by the genetically modified 
CHO cells is secreted in a proteolytically processed form, and that the resulting 
prodomain and mature C-terminal region associate non-covalently to form a complex 
containing two prodomain peptides and a disulfide-linked dimer of C-terminal 
5 proteolytic fragments, similar to that described for TGF-B. In the TGF-(3 complex, the 
C-terminal dimer exists in an inactive, latent form (Miyazono et al., J. Biol. Chem. 
263:6407-6415, 1988), and the active species can be released from this latent complex 
by treatment with acid, chaotropic agents, reactive oxygen species, or plasmin, or by 
interactions with other proteins, including thrombospondin and integrin avB6 

10 (Lawrence et al., Biochem. Biophys. Res. Comm. 133:1026-1034, 1985; Lyons et al, 
J. Cell Biol. 106:1659-1665, 1988; Schultz-Cherry and Murphy-Ullrich, J. Cell Biol. 
122:923-932, 1993: Barcellos-Hoff and Dix. Mol. Endocrinol. 10:1077-1083. 1996; 
Munger et al., Cell 96:3 19-328, 1999). Furthermore, the addition of purified 
prodomain peptide (also known as latency-associated peptide or LAP) to the TGF-p 

15 complex inhibits the biological activity of the purified C-terminal dimer in vitro and 
in vivo (Gentry and Nash, Biochemistry 29:6851-6857, 1990; Bottinger et al, Proc. 
Natl. Acad. Sci.. USA 93:5877-5882. 1996). 

The heparin eluate, which consisted of a complex of prodomain and mature 
20 C-terminal peptide, was further purified using an HPLC C4 reversed phase column. 
The C-terminal dimer eluted from the HPLC column earlier than the prodomain, thus 
allowing the isolation of the C-terminal dimer free of prodomain. Fractions that 
contained mostly prodomain also were obtained, although these fractions contained 
small amounts of the C-terminal dimer. Some of the protein also was present as 
25 higher molecular weight complexes. The nature of the higher molecular weight 

complexes is unknown, but, based on western blot analysis in the presence or absence 
of reducing agents, these complexes can contain at least one prodomain peptide and 
one C-terminal mature myostatin peptide linked by one or more disulfide bonds. In 
fact, most of the mature C-terminal peptide present in the HPLC fraction enriched for 
30 the propeptide (HPLC fractions 35-37) was present in these high molecular weight 
complexes. These higher molecular weight complexes likely represent improperly 
folded proteins that are secreted by the genetically modified CHO cells. 
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EXAMPLE 7 

MYOSTATIN SPECIFICALLY INTERACTS WTTH AN ACTTVIN RECEPTOR 
This example demonstrates that myostatin specifically binds an activin type II 
receptor expressed on cells in culture, and that this specific binding is inhibited by a 
5 myostatin prodomain. 

The receptors for some TGF-6 family members have been identified, and most 
are single membrane spanning serine/threonine kinases (Massague and Weis-Garcia, 
Cancer Surveys 27:41-64, 1996). The activin type II receptors (Act RIIA and/or Act 

10 RIIB), for example, are known to bind members of the TGF-B superfamily. The 
phenotype of mice lacking the Act RIIB receptor showed anterior/posterior axial 
patterning defects and kidney abnormalities that were very similar to those observed 
in GDF-1 1 knock-out mice (McPherron et al., Nat. Genet. 22:260-264, 1999; Oh and 
Li, Genes Devel. 11:1812-1826, 1997). Since the amino acid sequences of GDF-1 1 

1 5 and myostatin (GDF-8) are 90% identical in the mature C-terminal region, the ability 
of myostatin to specifically interact with a activin type II receptor was examined 

Myostatin was labeled by radio-iodination, and binding studies were 
performed using COS cells transfected with an Act RIIB expression construct. 

20 Myostatin interacted specifically with the transfected COS cells. Myostatin binding 
was competed in a dose dependent manner by excess unlabeled myostatin, and was 
significantly lower in control COS cells, which were transfected with an empty 
vector. No significant binding occurred to cells transfected with a BMP RII or 
TGF-B RII expression construct. Myostatin binding to Act RIIB-transfected cells was 

25 saturable, and the binding affinity was approximately 5 nM as determined by 
Scatchard analysis. 

The receptor binding assay also was used to examine the ability of the 
myostatin prodomain to inhibit the ability of the mature C-terminal dimer to interact 
30 specifically with Act RIIB in this system. The addition of purified prodomain peptide 
blocked the ability of the C-terminal dimer to bind the Act RIIB transfected COS cells 
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in a dose-dependent manner. These results indicate that the myostatin prodomain is a 
natural inhibitor of myostatin. 

EXAMPLE 8 

5 INCREASED MYOSTATIN LEVELS INDUCE WEIGHT LOSS 

This example demonstrates that elevated levels of myostatin can lead to 
substantial weight loss in vivo. 

In one set of experiments, CHO cells that express myostatin were injected into 
1 0 nude mice. The nude mice that had myostatin expressing CHO cell tumors showed 
severe wasting over the course of approximately 12 to 16 days following injection of 
the cells. This wasting syndrome was not observed in nude mice injected with any of 
a variety of control CHO lines that had undergone a similar selection process but did 
not express myostatin. Furthermore, the myostatin coding sequence in the construct 
1 5 used to transfect the CHO cells was under the control of a metallothionein promoter, 
and the .wasting syndrome was exacerbated when mice bearing the myostatin 
expressing tumors were maintained on water containing zinc sulfate. Western blot 
analysis revealed high levels of myostatin protein in the serum of the nude mice that 
bore the myostatin-expressing CHO cells. These results indicate that the wasting 
20 syndrome was induced in response to the elevated level of myostatin in the nude mice 
and, as discussed below, this result was confirmed by observing similar effects in 
mice injected with purified myostatin. 

The dramatic weight loss observed in the nude mice bearing myostatin 
25 expressing CHO cells was due primarily to a disproportionate loss of both fat and 
muscle weight. White fat pad weights (intrascapular white, uterine, and 
retroperitoneal fat) were reduced by greater than 90% compared to mice bearing 
control CHO cell tumors. Muscle weights were also severely reduced, with individual 
muscles weighing approximately half as much in myostatin expressing mice as in 
30 control mice by day 16. This loss in muscle weight was reflected by a corresponding 
decrease in fiber sizes and protein content. 
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Mice bearing the myostatin expressing CHO cell tumors also became severely 
hypoglycemic. However, the weight loss and hypoglycemia were not due to a 
difference in food consumption, as all of the mice consumed equivalent amounts of 
food at each time interval examined during the 16 day course of the study. These 
5 results indicate that myostatin overexpression induces a dramatic weight loss, which 
resembles the cachectic wasting syndrome that occurs in patients suffering from 
chronic diseases such as cancer or AIDS. 

With more chronic administrations using lower doses of myostatin, changes in 
10 fat weight were observed. For example; twice daily injections of 1 p.g of myostatin 
protein for 7 days resulted in an approximately 50% decrease in the weights of a 
number of different white fat pads (intrascapular white, uterine, and retroperitoneal fat 
pads) with no significant effect on brown fat (intrascapular brown). These results 
confirm that myostatin can induce weight loss and, in extreme cases, a wasting 
15 syndrome in vivo. 

EXAMPLE 9 

CHARACTERIZATION OF MYOSTATIN BINDING 
TO AN ACTIVIN RECEPTOR 
20 This example describes a method for characterizing the relationship of 

myostatin binding to an activin receptor with the biological effects produced by 
myostatin in vivo. 

Act RIIA or Act RIIB knock-out mice can be used to confirm that Act RIIA or 
25 Act RIIB is a receptor for myostatin in vivo. A detailed muscle analysis of these mice 
can determine whether knock-out of an activin receptors is associated with a change 
in muscle fiber number or size. Since Act RII A/Act RIIB double homozygous mutant 
die early during embryogenesis (Song et al, Devel. Biol. 213:157-169, 1999), only 
the various homozygous/heterozygous combinations can be examined. However, 
30 tissue-specific or conditional knock-out mice can be generated such that both genes 
can be "deleted" only in muscle tissue, thus allowing postnatal examination of the 
double homozygous knock-out mice. 
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The effect on adipose tissue can be examined with aging of the mice to 
determine whether the number of adipocytes or the accumulation of lipid by these 
adipocytes is altered in the knock-out mice. Adipocyte number and size is determined 
5 by preparing cell suspensions from collagenase treated tissue (Rodbell, J. Biol. Chem. 
239:375-380, 1964; Hirsch and Gallian, J. Lipid Res. 9:1 10-1 19, 1968). Total lipid 
content in the animals is determined by measuring dry carcass weights and then the 
residual dry carcass weights after lipid extraction (Folch et al., J. Biol. Chem. 
226:497-509, 1957). 

10 

A variety of serum parameters also can be examined, including fed and fasting 
glucose and insulin, triglycerides, cholesterol, and leptin. As disclosed above, serum 
triglycerides and serum insulin are decreased in the myostatin mutant animals. The 
ability of the activin receptor knock-out mice to respond to an exogenous glucose load 
15 also can be examined using glucose tolerance tests. As disclosed above, the response 
to a glucose load essentially was identical in wild type and myostatin mutant mice at 
5 months of age. This observation can be extended by measuring these parameters in 
the mice as they age. Serum insulin levels also can be measured at various times 
during the glucose tolerance tests. 

20 

Basal metabolic rates also can be monitored using a calorimeter (Columbus 
Instruments). As disclosed above, myostatin mutant mice have a lower metabolic rate 
at 3 months of age than their wild type counterparts. This analysis can be extended to 
older mice, and the respiratory quotient also can be measured in these animals. The 
25 ability to maintain normal thermogenesis can be determined by measuring the basal 
body temperatures as well as their ability to maintain body temperature when placed 
at 4°C. Brown fat weights and expression levels of UCP1, UCP2, and UCP3 in 
brown fat, white fat, muscle, and other tissues also can be examined (Schrauwen et 
al., 1999). 

30 

Food intake relative to weight gain can be monitored, and feed efficiency can 
be calculated. In addition, the weight gain of the animals placed on high fat diets can 
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be monitored. Wild type mice maintained on a high fat diet accumulate fat rapidly, 
whereas the results disclosed herein indicate the activin receptor mutant animals will 
remain relatively lean. 

5 The results of these studies can provide a more complete profile of the effect 

of myostatin mice, particularly with respect to their overall metabolic state, thereby 
providing insight as to whether the ability of the myostatin knock-out mice to 
suppress the accumulation of fat is an anabolic effect of the myostatin mutation in 
muscle that leads to a shift in energy utilization such that little energy is available for 
10 storage in the form of fat. For example, the decreased fat accumulation can be due to 
an increased rate of thermogenesis. These results also will provide a baseline for 
comparison of the effect of the myostatin activity in the context of different genetic 
models of obesity and type II diabetes. 

15 EXAMPLE 10 

CHARACTERIZATION OF MYOSTATIN EFFECTS 
IN GENETIC MODELS OF OBESITY AND TYPE II DIABETES 
This example describes methods for determining the effect of myostatin in 
treating obesity or type II diabetes. 

20 

The dramatic reduction in overall fat accumulation in myostatin mutant mice 
as compared to wild type mice indicates that myostatin activity can be manipulated to 
treat or prevent obesity or type II diabetes. The effect of the myostatin mutation can 
be examined in the context of several well characterized mouse models of these 

25 metabolic diseases, including, for example, "obese" mice (ob/ob), "diabetic" mice 
(db/db), and agouti lethal yellow (A y ) mutant strains. Each of these strains is 
abnormal for virtually every parameter and test described above (see, for example, 
Yen et al, supra, 1994, Friedman and Halaas, Nature 395:763-770, 1998). The 
ability of the myostatin mutation to slow or suppress the development of these 

30 abnormalities in mice carrying these other mutations can be examined by constructing 
double mutants, then subjecting the double mutant animals, along with appropriate 
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control littermates carrying only the ob/ob, db/db, or agouti lethal yellow mutations, 
to the various tests. 

As disclosed above, the myostatin mutation in A y mice was associated with 
5 about a 5 -fold suppression of fat accumulation in the myostatin mutant A y mice, and 
with a partial suppression of the development of abnormal glucose metabolism as 
assessed by glucose tolerance tests. These results can be extended to include 
additional animals at various ages, and similar studies can be performed with the 
ob/ob and db/db mutants. As both the these mutations are recessive, mice that are 

10 doubly homozygous for the myostatin mutation and either the ob or db mutation can 
be generated. In order to examine the effects of partial loss of myostatin function in 
these genetic model systems, mice that are homozygous for the ob or db mutation and 
heterozygous for the myostatin mutation also are examined. Mice that are doubly 
heterozygous for the myostatin and ob mutations have been generated, and the 

1 5 offspring from matings of these doubly heterozygous mice can be examined, 
particularly with respect to fat accumulation and glucose metabolism. Partial 
suppression of either or both of these abnormalities in the obese mutants can indicate 
that myostatin is a target for the treatment of obesity and type II diabetes. 

20 EXAMPLE 11 

CHARACTERIZATION OF TRANSGENIC MICE EXPRESSING DOMINANT 
NEGATIVE POLYPEPTIDES THAT CAN AFFECT MYOSTATIN ACTIVITY 
This example describes methods for characterizing the effect of myostatin 
postnatally by expressing dominant negative polypeptides that can block myostatin 
25 expression or myostatin signal transduction. 

Myostatin Inhibitors 

The modulation of myostatin activity postnatally can be used to determine the 
effect of myostatin on muscle fiber number (hyperplasia)and muscle fiber size 
30 (hypertrophy). Conditional myostatin knock-out mice, in which the myostatin gene is 
deleted at defined times during the life of the animal, can be used for these studies. 
The tet regulator in combination with the ere recombinase provides a system for 
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generating such mice. In this system, the expression of ere is induced by 
administration of doxycycline. 

Transgenic mice expressing an inhibitor of myostatin from an inducible 
5 promoter also can be generated such that myostatin activity can be reduced or 

inhibited at defined times during the life of the animal. The tetracycline regulators are 
useful for generating such transgenic mice, in which the myostatin expression is 
induced by doxycycline. 

1 0 A modification of the tet system, which utilizes co-expression of a hybrid 

reverse tet-transactivator (fusion protein of the activation domain of VP 16 with the 
mutant reverse tet repressor) and a hybrid tet-transrepressor (fusion protein of the 
KRAB repressor domain of mammalian Koxl with the native tet repressor), can be 
particularly useful for producing the transgenic mice (Rossi et al., Nat. Genet. 20:389- 

15 393, 1998; Forster et al., Nuc.1. Acids Res. 27:708-710, 1999). In this system, the 
hybrid reverse tet-transactivator binds tet operator sequences, and activates 
transcription only in the presence of tetracycline; the hybrid tet-transrepressor binds 
tet operator sequences and represses transcription only in the absence of tetracycline. 
By co-expressing these two fusion proteins, the basal activity of the target promoter is 

20 silenced by the tet-transrepressor in the absence of tetracycline, and is activated by the 
reverse tet-transactivator upon administration of tetracycline. 

Two types of transgenic lines can be generated. In the first type, the transgene 
encodes a myostatin inhibitor polypeptide under the control of a muscle specific 

25 promoter, for example, the muscle creatine kinase promoter (Sternberg et al., supra, 
1988) or the myosin light chain enhancer/promoter (Donoghue et al., supra, 1991). 
Individual transgenic lines are screened for specific expression of the tet regulators in 
skeletal muscle, and several independent lines for each of the two promoters are 
selected and examined to confirm that any effects observed are not due, for example, 

30 to integration site-specific effects. A construct containing the two tet regulators under 
the control of the myosin light chain promoter/enhancer has been constructed, and can 
be used for pronuclear injections. In the second type of line, the transgene contains a 
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myostatin inhibitor polypeptide under the control of a minimal CMV promoter that 
further contains tet operator sequences. 

The myostatin inhibitor can be a dominant negative form of myostatin or a 
myostatin prodomain, which, as disclosed herein, can inhibit myostatin activity. 
Dominant negative forms of TGF-B family members have been described (see, for 
example, Lopez et al., Mol. Cell Biol. 12:1674-1679, 1992; Wittbrodt and Rosa, 
Genes Devel. 8:1448-1462, 1994), and contain, for example, a mutant proteolytic 
cleavage site, thereby preventing the protein from being processed into the 
biologically active species. When co-expressed in a cell with the endogenous wild 
type gene, the mutant protein forms non-functional heterodimers with the wild type 
protein, thus acting as a dominant negative. A mutant myostatin polypeptide 
containing a mutation in the promyostatin cleavage site has been constructed, and can 
be examined for a dominant negative effect by co-expressing the mutant with wild 
type myostatin in varying ratios in 293 cells. Conditioned medium from 293 cells 
transiently transfected with the constructs can be examined by western blot analysis 
and the ability of the mutant to block the formation of mature C-terminal dimers can 
be examined. 

An expression construct encoding only the myostatin prodomain also can be 
utilized. As disclosed above, the prodomain forms a tight complex with the mature 
C-termmal dimer and blocks the ability of the mature C-terminal myostatin dimer to 
bind Act RIIB in cells expressing the receptor in culture. By analogy to TGF-B, the 
myostatin prodomain also can maintain the mature C-terminal dimer in an inactive 
latent complex in vivo. 

These transgenic animals can be bred with those expressing the tet regulators 
to generate doubly transgenic lines containing both the tet regulators and the inhibitor 
target construct. These doubly transgenic lines can be screened for those in which all 
of the different components are expressed appropriately. Northern blot analysis using 
RNA obtained from various muscles and control tissues from representative mice in 
each line, before and after administration of doxy cycline in the drinking water, can be 

Gray Cary\GT\6209497.1 
104659-160348 



113 



used to identify such transgenic lines. Transgenic lines will be selected that do not 
express the transgene in any tissue in the absence of doxycycline, and that express the 
transgene only in muscle in the presence of doxycycline. 

5 Doxycycline is administered to the selected transgenic animals and the effect 

on muscle mass is examined. Doxycycline can be administered to pregnant mothers 
to induce the expression of the inhibitor during embryogenesis. The effect of 
blocking myostatin activity during development of the transgenic animals can be 
compared to the effects observed in myostatin knock-out mice. Since the promoters 
1 0 for driving expression of the tet regulators can be induced at a later time during 
development than the time when myostatin is first expressed, the effect on muscle 
mass in the transgenic mice can be compared to the effect that occurs in the myostatin 
knock-out mice. 

1 5 The effect of inhibiting myostatin activity postnatally can be examined by 

administering doxycycline to the doubly transgenic mice at various times after birth. 
Doxycycline treatment can begin, for example, at 3 weeks of age, and the animals can 
be analyzed at 5 months of age, which is the age at which the difference in muscle 
weights were at a maximum in the myostatin knock-out mice versus wild type mice. 

20 The animals are examined for the effects of the inhibitor on muscle mass. Muscles 
also can be examined histologically to determine effects on fiber number and fiber 
size. In addition, a fiber type analysis of various muscles in the transgenic mice can 
be performed to determine whether there is a selective effect on type I or type II 
fibers. 

25 

Doxycycline can be administered in different doses and at different times to 
characterize the effect of the myostatin inhibitors. Doubly transgenic mice also can 
be maintained chronically on doxycycline, then examined for effects on fat pad 
weights and other relevant metabolic parameters as described above. The results of 
30 these studies can confirm that modulating myostatin activity postnatally can increase 
muscle mass or decrease fat accumulation, thus indicating that targeting myostatin can 
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be useful for the treatment of a variety of muscle wasting and metabolic diseases 
clinically. 

Myostatin 

5 Transgenic mice containing a myostatin transgene also can be examined and 

the effects produced upon expression of myostatin can be compared with those 
observed in the nude mice containing the myostatin-expressing CHO cells. Similarly 
as described above, myostatin can be placed under control of conditional (tet) and 
tissue specific regulatory elements, and expression of myostatin in the transgenic mice 
1 0 can be examined to determine whether a wasting syndrome occurs similar to that 
observed in the nude mice. The myostatin transgene can include, for example, 
processing signals derived from SV40, such that transgene can be distinguished from 
the endogenous myostatin gene. 

1 5 Serum samples can be isolated from the myostatin transgenic mice at various 

times following the administration of doxycycline and the level of myostatin 
transgene product in the serum can be determined. Total body weights of the animals 
are monitored over time to determine whether the animals show significant weight 
loss. In addition, individual muscles and fat pads are isolated and weighed, and the 

20 number, size and type of muscle fibers are determined in selected muscle samples. 

The level of myostatin transgene expression can be varied by varying the dose 
of doxycycline administered to the animals. Transgene expression can be monitored 
using, for example, northern blot analysis of transgene RNA levels in muscle, or 

25 myostatin protein levels in serum. The identification of specific levels of myostatin 
transgene expression allows a correlation of the extent of wasting induced by 
myostatin. The transgenic lines also can be crossed with the myostatin knock-out 
mice to generate mice in which the only source of myostatin is expression from the 
transgene. Expression of myostatin at various times during development can be 

30 examined and the effect of myostatin on fiber number, fiber size, and fiber type can 
be determined. The availability of mice in which the expression of myostatin can be 
precisely and rapidly controlled provides a powerful tool for further characterizing the 
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myostatin signal transduction pathway and for examining the effects of various agents 
that potentially can be useful for modulating myostatin signal transduction. 

Effectors of myostatin signal transduction 
5 Transgenic mice containing either dominant negative forms of a myostatin 

signal transduction pathway, which can include components of a TGF-p signal 
transduction pathway, that is expressed specifically in skeletal muscle can be 
generated. As disclosed herein, the Smad proteins, which mediate signal transduction 
through a pathway induced by activin type II receptors, can be involved in myostatin 
10 signal transduction. 

Act RIIB can bind GDF-11, which is highly related to myostatin (McPherron 
et al., supra, 1997; Gamer et al., supra, 1999; Nakashima et al., Mech. Devel. 80:185- 
189, 1999), and expression of c-ski, which can bind an inhibit Smad 2, Smad 3, and 

1 5 Smad 4, dramatically affects muscle growth (Sutrave et al., supra, 1 990; Berk et al., 
supra, 1997; see, also, Luo et al., supra, 1999; Stroschein et al., supra, 1999; Sun et 
al., supra, 1999a and b; Akiyoshi et al, supra, 1999). As disclosed herein, myostatin 
interacts specifically with Act RIIB and, therefore, can exert its biological effect, at 
least in part, by binding to activin type II receptors in vivo and activating the Smad 

20 signaling pathway. 

The role of the Smad signaling pathway in regulating muscle growth can be 
examined using transgenic mouse lines that are blocked, or capable of being blocked, 
at specific points in the Act RIIB/Smad signal transduction pathway. The muscle 
25 creatine kinase promoter or myosin light chain enhancer/promoter can be used to 
drive expression of various inhibitors of the Smad signal transduction pathway. 

An inhibitor useful in this system can include, for example, follistatin; a 
dominant negative Act RIIB receptor; a dominant negative Smad polypeptide such as 
30 Smad 3; c-ski; or an inhibitory Smad polypeptide such as Smad 7. Follistatin can 
bind and inhibit the activity of certain TGF-B family members, including GDF-11 
(Gamer et al., supra, 1999). Dominant negative forms of an activin type II receptor 
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can be obtained, for example, by expressing the extracellular domain, particularly a 
soluble form of an Act RIIB extracellular domain, or by expressing a truncated 
Act RIIB receptor that lacks the kinase domain or contains a mutation such that the 
mutant receptor lacks kinase activity. Smad 7 functions as an inhibitory Smad that 
5 can block the signaling pathway induced by activin, TGF-B, and BMP. A dominant 
negative form of Smad 3, for example, can be constructed by mutating the Smad 3 
C-terminal phosphorylation sites, thereby blocking Smad 3 function (Liu et al., supra, 
1997). c-ski overexpression has been correlated to muscle hypertrophy in transgenic 
mice (Sutrave et al., supra, 1990). 

10 

Transgenic mice can be prepared and each founder line examined for proper, 
muscle-specific expression of the transgene. The selected mice are examined for total 
body weights, individual muscle weights, and muscle fiber sizes, numbers and types. 
Those lines demonstrating a clear effect on muscle mass can be examined further for 

1 5 fat accumulation and other relevant metabolic parameters as described above. The 

use of these different agents to target specific steps in the activin receptor/Smad signal 
transduction pathway is particularly informative because the signaling pathways for 
the different agents overlap at different steps. For example, follistatin binds to and 
inhibits activin and GDF-1 1 activity, but not TGF-B, whereas a dominant negative 

20 Smad 3 can block signaling through both activin and TGF-B receptors. Smad 7 can 
have an even more pleotropic because it blocks signaling through BMP receptors as 
well. The studies can allow the identification of specific targets for modulating 
myostatin activity, thus providing various strategies for developing drugs or other 
agents that modulate myostatin signal transduction and, therefore, myostatin activity. 

25 

In particular, the transgenic lines described herein can be used to determine 
the effect of blocking myostatin function or the Smad signaling pathway postnatally 
on the development of obesity or type II diabetes. For example, the inhibitory 
transgenes can e crossed into the ob/ob, db/db, and A y mutant mice. In the absence of 
30 doxycycline, an inhibitor transgene is not expressed and, therefore, the animals are 
indistinguishable from each of the parental mutant mice. In the presence of 
doxycycline, the inhibitor is expressed and can block myostatin activity. The effect of 
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blocking myostatin activity on development of the metabolic abnormalities in these 
mutant animals can be examined. 

Expression of the inhibitor can be induced at an early age, for example, at 
5 3 weeks of age, to maximize the effect. In addition, myostatin activity can be blocked 
prior to the time that the metabolic abnormalities become so severe as to be 
irreversible. Animals can be maintained on doxycycline and assessed at various ages 
using the tests described above, including those relating to fat accumulation and 
glucose metabolism. Any delay in the age at which one or more test results becomes 

1 0 abnormal in the ob/ob, db/db, and A y mutant animals can be identified. Similar 
studies can be performed using older animals, which have developed some of the 
signs of obesity or type II diabetes, and the effect of blocking myostatin activity on 
various parameters, including fat weight and glucose metabolism, can be determined. 
The results of these studies can further identify specific targets that can be 

1 5 manipulated in an effort to prevent or treat obesity or type II diabetes. 

EXAMPLE 12 

CHARACTF.RTZATTQN OF MYOSTATIN EFFECT 
ON THE IND UCTION OF CACHEXIA 
20 This example describes methods for determining the role of myostatin signal 

transduction in the development and progression of cachexia. 

The activin receptor and Smad pathway can constitute at least part of the 
signal transduction pathway involved in mediating myostatin activity in normal 

25 individuals and, therefore, can be involved in mediating the effects that occur in an 
individual due to excess levels of myostatin. As disclosed herein, cachexia, for 
example, can be mediated, at least in part, by abnormally high levels of myostatin. As 
such, methods for manipulating signal transduction through the Smad pathway can 
provide a new strategy for developing drugs for the treatment of muscle wasting in 

30 general and cachexia in particular. 
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The role of the Smad signaling pathway in cachexia can be examined by 
examining the susceptibility of the various transgenic lines described above to 
cachexia, which can be induced, for example, by interleukin-6 (IL-6; Black et al, 
Endocrinology 128:2657-2659, 1991, which is incorporated herein by reference]), 
5 tumor necrosis factor-cc (TNF-a; Oliff et al., Cell 50:555-563, 1987, which is 
incorporated herein by reference), or certain tumor cells. In the case of IL-6 and 
TNF-a, the inhibitor transgenes can be crossed into a nude mouse background, then 
the animals can be challenged with CHO cells that produce IL-6 or TNF-a, which 
induce wasting in nude mice when overexpressed in this manner. CHO cells that 
1 0 overproduce IL-6 or TNF-a can be prepared using the methods described above for 
generating myostatin overproducing cells. For example, TNF-a cDNA can be cloned 
into the-pMSXND expression vector (Lee and Nathans, J. Biol. Chem. 263:3521- 
3527, 1988), then cells carrying amplified copies of the expression construct can be 
selected stepwise in increasing concentrations of methotrexate. 

15 

Tumor cells such as Lewis lung carcinoma cells (Matthys et al., Eur. J. Cancer 
27:182-187, 1991, which is incorporated herein by reference) or colon 26 
adenocarcinoma cells (Tanaka et al., J. Cancer Res. 50:2290-2295, 1990, which is 
incorporated herein by reference), which can induce cachexia in mice, also can be 

20 utilized for these studies. These cell lines cause severe wasting when grown as 
tumors in mice. Thus, the effect of these tumors can be examined in the various 
transgenic mice described herein. It is recognized that the various tumor cells will 
only grow in certain genetic backgrounds. For example, the Lewis lung carcinoma 
cells are routinely grown in C57 BL/6 mice, and the colon 26 carcinoma cells are 

25 routinely grown in BALB/c mice. Thus, the transgenes can be backcrossed into these 
or other genetic backgrounds to allow growth of the tumor cells. 

Various parameters, including total body weight, individual muscle weight, 
muscle fiber size and number, food intake and serum parameters, including glucose 
30 levels, can be monitored. In addition, serum myostatin levels and myostatin RNA 
levels in muscle can be examined to confirm that increased myostatin expression is 
correlated with cachexia. The results of these studies can confirm that the action of 
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myostatin is downstream of the cachexia-inducing agents in these experimental 
models. The results also can confirm that the Smad signaling pathway is essential for 
development of cachexia in these models, and can demonstrate that a therapeutic 
benefit can be obtained in the treatment of cachexia by modulating the Smad 
5 signaling pathway. 

EXAMPLE 13 

IDENTIFICATION AND CHARACTERIZATION OF GROWTH 
DIFFERENTIATION FACTOR-8 fGDF-8^ and GDF-1 1 RECEPTORS 
10 This example describes methods for identifying and characterizing cell surface 

receptors for GDF-8 (myostatin) and GDF-1 1 . 

The purified GDF-8 and GDF-1 1 proteins will be used primarily to assay for 
biological activities. In order to identify potential target cells for GDF-8 and GDF-1 1 

1 5 action cells expressing their receptors will be searched. For this purpose, the purified 
protein will be radio-iodinated using the chloramine T method, which has been used 
successfully to label other members of this superfamily, like TGF-B (Cheifetz et al., 
supra, 1987), activins (Sugino et al., J. Biol. Chem. 263:15249-15252, 1988), and 
BMPs (Paralkar et al., Proc. Natl. Acad. Sci.. USA 88:3397-3401, 1991), for receptor 

20 binding studies. The mature processed forms of GDF-8 and GDF-1 1 each contain 
multiple tyrosine residues. Two different approaches will be taken to identify 
receptors for these proteins. 

One approach will determine the number, affinity, and distribution of 
25 receptors. Either whole cells grown in culture, frozen sections of embryos or adult 
tissues, -or total membrane fractions prepared from tissues or cultured cells will be 
incubated with the labeled protein, and the amount or distribution of bound protein 
will be determined. For experiments involving cell lines or membranes, the amount 
of binding will be determined by measuring either the amount of radioactivity bound 
30 to cells on the dish after several washes or, in the case of membranes, the amount of 
radioactivity sedimented with the membranes after centrifugation or retained with the 
membranes on a filter. For experiments involving primary cultures, where the 
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number of cells can be more limited, binding sites will be visualized directly by 
overlaying with photographic emulsion. For experiments involving frozen sections, 
sites of ligand binding will be visualized by exposing these sections to high resolution 
Beta-max hyperfilm; if finer localization is required, the sections will be dipped in 
5 photographic emulsion. For all of these experiments, specific binding will be 

determined by adding excess unlabeled protein as competitor (for example, see Lee 
and Nathans, supra, 1988). 

A second approach will be to characterize the receptor biochemically. 

1 0 Membrane preparations or potential target cells grown in culture will be incubated 
with labeled ligand, and receptor/ligand complexes will be covalently cross-linked 
using disuccinimidyl suberate, which has been commonly used to identify receptors 
for a variety of ligands, including members of the TGF-B superfamily (Massague and 
Like, J. Biol. Chem. 260:2636-2645, 1985). Cross-linked complexes are separated by 

1 5 electrophoresis on SDS polyacrylamide gels to look for bands labeled in the absence, 
but not in the presence, of excess unlabeled protein. The molecular weight of the 
putative receptor will be estimated by subtracting the molecular weight of the ligand. 
An important question that these experiments will address is whether GDF-8 and 
GDF-1 1 signal through type I and type II receptors like many other members of the 

20 TGF-B superfamily (Massague and Weis-Garcia, supra, 1996). 

Once a method for detecting receptors for these molecules has been achieved, 
more detailed analysis will be carried out to determine the binding affinities and 
specificities. A Scatchard analysis will be used to determine the number of binding 

25 sites and dissociation constants. By carrying out cross-competition analyses between 
GDF-8 and GDF-1 1, it will be possible to determine whether they are capable of 
binding to the same receptor and their relative affinities. These studies will give an 
indication as to whether the molecules signal through the same or different receptors. 
Competition experiments using other TGF-B family members will be performed to 

30 determine specificity. Some of these ligands are available commercially, and some 
others are available from Genetics Institute, Inc. 
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For these experiments, a variety of embryonic and adult tissues and cell lines 
will be tested. Based on the specific expression of GDF-8 in skeletal muscle and the 
phenotype of GDF-8 knock-out mice, initial studies focus on embryonic and adult 
muscle tissue for membrane preparation and for receptor studies using frozen 
5 sections. In addition, myoblasts will be isolated and cultured from embryos at various 
days of gestation or satellite cells from adult muscle as described (Vivarelli and 
Cossu. Devel. Biol. 117:319-325. 1986; Cossu et al., CellDiff 9:357-368, 1980). 
The binding studies on these primary cells after various days in culture will be 
performed and binding sites localized by autoradiography so that the binding sites can 

10 be c-o-localized with various myogenic markers, such as muscle myosin (Vivarelli et 
al., J. Cell Biol. 107:2191-2197, 1988), and correlate binding with the differentiation 
state of the cells, such as formation of multinucleated myotubes. In addition to using 
primary cells, cell lines will be utilized to look for receptors. In particular, the initial 
focus will be on three cells lines, C2C12, L6, and PI 9. C2C12 and L6 myoblasts 

1 5 differentiate spontaneously in culture and form myotubes depending on the particular 
growth conditions (Yaffe and Saxel, supra, 1977; Yaffe, supra, 1968). P19 
embryonal carcinoma cells can be induced to differentiate into various cell types, 
including skeletal muscle cells in the presence of DMSO (Rudnicki and McBurney, 
Teratocarcinomas and Embryonic Stem Cells: A practical approach (E.J. Robertson, 

20 IRL Press, Cambridge 1987). Receptor binding studies will be carried out on these 
cell lines under various growth conditions and at various stages of differentiation. 
Although the initial studies will focus on muscle cells, other tissues and cell types will 
be examined for the presence of GDF-8 and GDF-1 1 receptors. 

25 Recombinant human GDF-8 (rhGDF-8) homodimer will be used in these 

binding studies. RhGDF-8 was expressed using CHO cells and purified to 
approximately 90% purity. The rhGDF-8 had the expected 25 kDa to 27 kDa 
molecular weight and, upon reduction, was reduced to the 12 kDa monomer. Using 
1-125 labeled GDF-8 in a receptor-ligand binding assay, two myoblast cell lines, L6 

30 and G-8, bound GDF-8. The binding was specific since non labeled GDF-8 

effectively competed the binding of the labeled ligand. The dissociation constant 
(Kd) was 370 pM, and L6 myoblasts have a high number (5,000 receptors/cell) of cell 
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surface binding proteins. GDF-1 1 (BMP-1 1) is highly homologous (>90%) to 
GDF-8. Receptor binding studies revealed that GDF-8 and GDF-1 1 bound to the 
same binding proteins on L6 myoblasts. It is important to establish whether or not 
GDF-8 binds to the known TGF-p receptor. TGF-p did not compete the binding of 
GDF-8, indicating that the GDF-8 receptor is distinct from the TGF-P receptor. The 
GDF-8 receptor was not expressed on all myoblast cell lines, including four myoblast 
cell lines, C2C12, G7, MLB13MYC cl4 and BC3H1, which do not bind GDF-8. 

The gene or genes encoding receptors for GDF-8 and GDF-1 1 can be 
obtained. As a first step towards understanding the mechanism by which GDF-8 and 
GDF-1 1 exert their biological effects, it is important to clone the genes encoding their 
receptors. From the experiments above, it will be more clear as to whether GDF-8 
and GDF-1 1 bind to the same receptor or to different receptors. There will also be 
considerable information regarding the tissue and cell type distribution of these 
receptors. Using this information, two different approaches will be taken to clone the 
receptor genes. 

The first approach will be to use an expression cloning strategy. In fact, this 
was the strategy that was originally used by Mathews and Vale (Cell 65:973-982, 
1991) and Lin et al. (Cell 68:775-785, 1992) to clone the first activin and TGF-B 
receptors. Poly A-selected RNA from the tissue or cell type that expresses the highest 
relative number of high affinity binding sites will be obtained, and used to prepare a 
cDNA library in the mammalian expression vector pcDNA-1 , which contains a CMV 
promoter and an SV40 origin of replication. The library will be plated, and cells from 
each plate will be pooled into broth and frozen. Aliquots from each pool will be 
grown for preparation of DNA. Each individual pool will be transiently transfected 
into COS cells in chamber slides, and transfected cells will be incubated with 
iodinated GDF-8 or GDF-1 1. After washing away the unbound protein, the sites of 
ligand binding will be visualized by autoradiography. Once a positive pool is 
identified, the cells from that pool will be replated at lower density, and the process 
will be repeated. Positive pools will then be plated, and individual colonies will be 
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picked into grids and re-analyzed as described (Wong et al., Science 228:810-815, 
1985). 

Initially, using pool sizes of 1 500 colonies will be screened. In order to be 
5 certain to identify a positive clone in a mixture of this complexity, a control 

experiment using TGF-B and a cloned type II receptor will be performed. The coding 
sequence for the TGF-B type II receptor will be cloned into the pcDNA-1 vector, and 
bacteria transformed with this construct will be mixed with bacteria from our library 
at various ratios, including 1 : 1 500. The DNA prepared from this mixture then will be 
1 0 transfected into COS cells, incubated with iodinated TGF-B, and visualized by 

autoradiography. If positive signals are observed at a ratio of 1 : 1 500, pools of 1 500 
clones will be screened. Otherwise, smaller pool sizes corresponding to ratios at 
which the procedure is sensitive enough to identify a positive signal in control 
experiments will be used. 

15 

A second parallel strategy to attempt to clone the GDF-8 and GDF-1 1 
receptors also will be used, taking advantage of the fact that most receptors for 
members of the TGF-B superfamily that have been identified belong to the membrane- 
spanning serine/threonine kinase family (Massague and Weis-Garcia, supra, 1996). 

20 Because the cytoplasmic domains of these receptors are related in sequence, 

degenerate PCR probes will be used to clone members of this receptor family that are 
expressed in tissues that contain binding sites for GDF-8 and GDF-1 1 . In fact, this is 
the approach that has been used to identify most of the members of this receptor 
family. The general strategy will be to design degenerate primers corresponding to 

25 conserved regions of the known receptors, to use these primers for PCR on cDNA 
prepared from the appropriate RNA samples (most likely from skeletal muscle), to 
subclone the PCR products, and finally to sequence individual subclones. As 
sequences are identified, they will be used as hybridization probes to eliminate 
duplicate clones from further analysis. The receptors that are identified then will be 

30 tested for their ability to bind purified GDF-8 and GDF-1 1 . Because this screen will 
yield only small PCR products, full-length cDNA clones will be obtained for each 
receptor from cDNA libraries prepared from the appropriate tissue, inserted into the 
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pcDNA-1 vector, transfected into COS cells, and the transfected cells will be assayed 
for their ability to bind iodinated GDF-8 or GDF-1 1 . Ideally, every receptor that is 
identified in this screen will be tested for the ability to bind these ligands. However, 
the number of receptors that are identified can be large, and isolating all of the full- 
5 length cDNAs and testing them can require considerable effort. Almost certainly 
some of the receptors that are identified will correspond to known receptors, and for 
these, either obtaining full-length cDNA clones from other investigators or amplifying 
the coding sequences by PCR based on the published sequences should be 
straightforward. For novel sequences, the tissue distribution will be determined by 
1 0 northern blot analysis and the highest priority will be directed to those receptors 
whose expression pattern most closely resembles the distribution of GDF-8 and/or 
GDF-1 1 binding sites as determined above. 

In particular, it is known that these receptors fall into two classes, type I and 

1 5 type II, which can be distinguished based on the sequence and which are both 

required for full activity. Certain ligands cannot bind type I receptors in the absence 
of type II receptors while others are capable of binding both receptor types (Massague 
and Weis-Garcia, supra, 1996). The cross-linking experiments outlined above should 
give some indication as to whether both type I and type II receptors are also involved 

20 in signaling GDF-8 and GDF-1 1 . If so, it will be important to clone both of these 

receptor subtypes in order to fully understand how GDF-8 and GDF-1 1 transmit their 
signals. Because it cannot be predicted as to whether the type I receptor is capable of 
interacting with GDF-8 and GDF-1 1 in the absence of the type II receptor, type II 
receptor(s) will be cloned first. Only after at least one type II receptor has been 

25 identified for these ligands, will an attempt be made to identify the type I receptors for 
GDF-8 and GDF-1 1. The general strategy will be to cotransfect the type II receptor 
with each of the type I receptors that are identified in the PCR screen, then assay the 
transfected cells by crosslinking. If the type I receptor is part of the receptor complex 
for GDF-8 or GDF-1 1, two cross-linked receptor species should be detected in the 

30 transfected cells, one corresponding to the type I receptor and the other corresponding 
to the type II receptor. 
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The search for GDF-8 and GDF-1 1 receptors is further complicated by the fact 
at least one member of the TGF-B superfamily, namely, GDNF, is capable of 
signaling through a completely different type of receptor complex involving a GPI- 
linked component (GDNFR-alpha) and a receptor tyrosine kinase (c-ret; Trupp et al., 
5 Nature 381:785-789, 1996; Durbec et al., Nature 381:789-793, 1996; Treanor et al., 
Nature 382:80-83, 1996; Jing et al., Cell 85:1 1 13-1 124, 1996). Although GDNF is 
the most distantly-related member of the TGF-B superfamily, it is certainly possible 
that other TGF-B family members can also signal through an analogous receptor 
system. If GDF-8 and GDF-1 1 do signal through a similar receptor complex, the 
1 0 expression screening approach should be able to identify at least the GPI-linked 
component (indeed GDNFR-alpha was identified using an expression screening 
approach) of this complex. In the case of GDNF, the similar phenotypes of GDNF- 
and c-ret-deficient mice suggested c-ret as a potential receptor for GDNF. 

15 EXAMPLE 14 

PREPARATION AND CHARACTERIZATION 
OF GDF-1 1 KNOCK-OUT MICE 
The phenotype of GDF-1 1 knock-out mice in several respects resembles the 
phenotype of mice carrying a deletion of a receptor for some members of the 
20 TGF-psuperfamily, including the activin type IIB receptor (Act RIIB). To determine 
the biological function of GDF-1 1, the GDF-1 1 gene was disrupted by homologous 
targeting in embryonic stem cells. 

A murine 129 SvJ genomic library was prepared in lambda FIXII according to 
25 the instructions provided by Stratagene (La Jolla, CA). The structure of the GDF-1 1 
gene was deduced from restriction mapping and partial sequencing of phage clones 
isolated from the library. Vectors for preparing the targeting construct were kindly 
provided by Philip Soriano and Kirk Thomas. To ensure that the resulting mice 
would be null for GDF-1 1 function, the entire mature C-terminal region was deleted 
30 and replaced by a neo cassette. Rl ES cells were transfected with the targeting 

construct, selected with gancyclovir (2 jaM) and G418 (250 jag/ml), and analyzed by 
Southern blot analysis. 
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Homologous targeting of the GDF-1 1 gene was observed in 8/1 55 
gancyclovir/G418 doubly resistant ES cell clones. Following injection of several 
targeted clones into C57BL/6J blastocysts, chimeras were obtained from one ES clone 
5 that produced heterozygous pups when crossed to both C57BL/6J and 1 29/S vJ 

females. Crosses of C57BL/6J/129/SvJ hybrid Fl heterozygotes produced 49 wild 
type (34%), 94 heterozygous (66%), and no homozygous mutant adult offspring. 
Similarly, there were no adult homozygous null animals seen in the 129/SvJ 
background (32 wild type (36%) and 56 heterozygous mutant (64%) animals). 

10 

To determine the age at which homozygous mutants were dying, litters of 
embryos isolated at various gestational ages from heterozygous females that had been 
mated to heterozygous males were genotyped. At all embryonic stages examined, 
homozygous mutant embryos were present at approximately the predicted frequency 

15 of 25%. Among hybrid newborn mice, the different genotypes were also represented 
at the expected Mendelian ratio of 1 :2:1 (34 +/+ (28%), 61 +/- (50%), and 28 -/- 
(23%)). Homozygous mutant mice were born alive and were able to breath and nurse. 
All homozygous mutants died, however, within the first 24 hours after birth. The 
precise cause of death was unknown, but the lethality may have been related to the 

20 fact that the kidneys in homozygous mutants were either severely hypoplastic or 
completely absent. 

Homozygous mutant animals were easily recognizable by their severely 
shortened or absent tails. To further characterize the tail defects in these homozygous 

25 mutant animals, their skeletons were examined to determine the degree of disruption 
of the caudal vertebrae. A comparison of wild type and mutant skeleton preparations 
of late stage embryos and newborn mice, however, revealed differences not only in 
the caudal region of the animals but in many other regions as well. In nearly every 
case where differences were noted, the abnormalities appeared to represent homeotic 

30 transformations of vertebral segments in which particular segments appeared to have 
a morphology typical of more anterior segments. These transformations were evident 
throughout the axial skeleton extending from the cervical region to the caudal region. 
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Except for the defects seen in the axial skeleton, the rest of the skeleton, such as the 
cranium and limb bones, appeared normal. 

Anterior transformations of the vertebrae in mutant newborn animals were 
5 most readily apparent in the thoracic region, where there was a dramatic increase in 
the number of thoracic (T) segments. All wild type mice examined showed the 
typical pattern of 13 thoracic vertebrae each with its associated pair of ribs. In 
contrast, homozygous mutant mice showed a striking increase in the number of 
thoracic vertebrae. All homozygous mutants examined had 4 to 5 extra pairs of ribs 
10 for a total of 1 7 to 1 8, although in over 1/3 of these animals, the 1 8th rib appeared to 
be rudimentary. Hence, segments that would normally correspond to lumbar (L) 
segments LI to L4 or L5 appeared to have been transformed into thoracic segments in 
mutant animals. 



1 5 Moreover, transformations within the thoracic region in which one thoracic 

vertebra had a morphology characteristic of another thoracic vertebra were also 
evident. For example, in wild type mice, the first 7 pairs of ribs attach to the sternum, 
and the -remaining 6 are unattached or free. In homozygous mutants, there was an 
increase in the number of both attached and free pairs of ribs to 10-1 1 and 7-8, 

20 respectively. Therefore, thoracic segments T8, T9, T10, and in some cases even Tl 1, 
which all have free ribs in wild type animals, were transformed in mutant animals to 
have a characteristic typical of more anterior thoracic segments, namely, the presence 
of ribs attached to the sternum. Consistent with this finding, the transitional spinous 
process and transitional articular processes which are normally found on T10 in wild 

25 type animals were instead found on Tl 3 in homozygous mutants. Additional 

transformations within the thoracic region were also noted in certain mutant animals. 
For example, in wild type mice, the ribs derived from Tl normally touch the top of 
the sternum. However, in 2/23 hybrid and 2/3 129/SvJ homozygous mutant mice 
examined, T2 appeared to have been transformed to have a morphology resembling 

30 that of Tl ; that is, in these animals, the ribs derived from T2 extended to touch the top 
of the sternum. In these cases, the ribs derived from Tl appeared to fuse to the 
second pair of ribs. Finally, in 82% of homozygous mutants, the long spinous process 
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normally present on T2 was shifted to the position of T3. In certain other 
homozygous mutants, asymmetric fusion of a pair of vertebrosternal ribs was seen at 
other thoracic levels. 

The anterior transformations were not restricted to the thoracic region. The 
anterior, most transformation that we observed was at the level of the 6th cervical 
vertebra (C6). In wild type mice, C6 is readily identifiable by the presence of two 
anterior tuberculi on the ventral side. In several homozygous mutant mice, although 
one of these two anterior tuberculi was present on C6, the other was present at the 
position of C7 instead. Hence, in these mice, C7 appeared to have been partially 
transformed to have a morphology resembling that of C6. One other homozygous 
mutant had 2 anterior tuberculi on C7 but retained one on C6 for a complete C7 to C6 
transformation but a partial C6 to C5 transformation. 

Transformations of the axial skeleton also extended into the lumbar region. 
Whereas wild type animals normally have only 6 lumbar vertebrae, homozygous 
mutants had 8 to 9. At least 6 of the lumbar vertebrae in the mutants must have 
derived from segments that would normally have given rise to sacral and caudal 
vertebrae as the data described above suggest that 4 to 5 lumbar segments were 
transformed into thoracic segments. Hence, homozygous mutant mice had a total of 
33-34 presacral vertebrae compared to 26 presacral vertebrae normally present in wild 
type mice. The most common presacral vertebral patterns were C7/T18/L8 and 
C7/T18/L9 for mutant mice compared to C7/T13/L6 for wild type mice. The 
presence of additional presacral vertebrae in mutant animals was obvious even 
without detailed examination of the skeletons as the position of the hind limbs relative 
to the forelimbs was displaced posteriorly by 7 to 8 segments. 

Although the sacral and caudal vertebrae were also affected in homozygous 
mutant mice, the exact nature of each transformation was not as readily identifiable. 
In wild type mice, sacral segments SI and S2 typically have broad transverse 
processes compared to S3 and S4. In the mutants, there did not appear to be an 
identifiable SI or S2 vertebra. Instead, mutant animals had several vertebrae that 



GrayCary\GT\6209497.1 
104659-160348 



129 



appeared to have morphology similar to S3. In addition, the transverse processes of 
all 4 sacral vertebrae are normally fused to each other although in newborns often 
only fusions of the first 3 vertebrae are seen. In homozygous mutants, however, the 
transverse processes of the sacral vertebrae were usually unfused. In the caudal-most 
region, all mutant animals also had severely malformed vertebrae with extensive 
fusions of cartilage. Although the severity of the fusions made it difficult to count the 
total number of vertebrae in the caudal region, up to 15 transverse processes were 
counted in several animals. It could not be determined whether these represented 
sacral or caudal vertebrae in the mutants because morphologic criteria for 
distinguishing S4 from caudal vertebrae even in wild type newborn animals could not 
be established. Regardless of their identities, the total number of vertebrae in this 
region was significantly reduced from the normal number of approximately 30. 
Hence, although the mutants had significantly more thoracic and lumber vertebrae 
than wild type mice, the total number of segments was reduced in the mutants due to 
the truncation of the tails. 

Heterozygous mice also showed abnormalities in the axial skeleton although 
the phenotype was much milder than in homozygous mice. The most obvious 
abnormality in heterozygous mice was the presence of an additional thoracic segment 
with an-associated pair of ribs. This transformation was present in every 
heterozygous animal examined, and in every case, the additional pair of ribs was 
attached to the sternum. Hence, T8, whose associated rib normally does not touch the 
sternum, appeared to have been transformed to a morphology characteristic of a more 
anterior thoracic vertebra, and LI appeared to have been transformed to a morphology 
characteristic of a posterior thoracic vertebra. Other abnormalities indicative of 
anterior transformations were also seen to varying degrees in heterozygous mice. 
These included a shift of the long spinous process characteristic of T2 by one segment 
to T3, a shift of the articular and spinous processes from T10 to Tl 1, a shift of the 
anterior tuberculus on C6 to C7, and transformation of T2 to Tl where the rib 
associated with T2 touched the top of the sternum. 
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In order to understand the basis for the abnormalities in axial patterning seen 
in GDF-1 1 mutant mice, mutant embryos isolated at various stages of development 
were examined, and compared o wild type embryos. By gross morphological 
examination, homozygous mutant embryos isolated up to day 9.5 of gestation were 
5 not readily distinguishable from corresponding wild type embryos. In particular, the 
number of somites present at any given developmental age was identical between 
mutant and wild type embryos, suggesting that the rate of somite formation was 
unaltered in the mutants. By day 10.5-1 1 .5 p.c, mutant embryos could be easily 
distinguished from wild type embryos by the posterior displacement of the hind limb 
1 0 by 7-8 somites. The abnormalities in tail development were also readily apparent at 
this stage. Taken together, these data suggest that the abnormalities observed in the 
mutant skeletons represented true transformations of segment identities rather than the 
insertion of additional segments, for example, by an enhanced rate of somitogenesis. 

] 5 Alterations in expression of homeobox containing genes are known to cause 

transformations in Drosophila and in vertebrates. To see if the expression patterns of 
Hox genes (the vertebrate homeobox containing genes) were altered in GDF-1 1 null 
mutants, the expression pattern of 3 representative Hox genes, Hoxc-6, Hoxc-8 and 
Hoxc-1 1, was determined in day 12.5 p.c. wild type, heterozygous and homozygous 

20 mutant embryos by whole mount in situ hybridization. The expression pattern of 

Hoxc-6 in wild type embryos spanned prevertebrae 8-15 which correspond to thoracic 
segments T1-T8. In homozygous mutants, however, the Hoxc-6 expression pattern 
was shifted posteriorly and expanded to prevertebrae 9-18 (T2-T1 1). A similar shift 
was seen with the Hoxc-8 probe. In wild type embryos, Hoxc-8 was expressed in 

25 prevertebrae 13-18 (T6-T1 1) but, in homozygous mutant embryos, Hoxc-8 was 
expressed in prevertebrae 14-22 (T7-T15). Finally, Hoxc-1 1 expression was also 
shifted posteriorly in that the anterior boundary of expression changed from 
prevertebrae 28 tin wild type embryos to prevertebrae 36 in mutant embryos. (Note 
that because the position of the hind limb is also shifted posteriorly in mutant 

30 embryos, the Hoxc-1 1 expression patterns in wild type and mutant appeared similar 
relative to the hind limbs). These data provide further evidence that the skeletal 
abnormalities seen in mutant animals represent homeotic transformations. 
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The phenotype of GDF-1 1 mice suggested that GDF-1 1 acts early during 
embryogenesis as a global regulator of axial patterning. To begin to examine the 
mechanism by which GDF-1 1 exerts its effects, the expression pattern of GDF-1 1 in 
early mouse embryos was examined by whole mount in situ hybridization. At these 
stages, the primary sites of GDF-1 1 expression correlated precisely with the known 
sites at which mesodermal cells are generated. Expression of GDF-1 1 was first 
detected at day 8.25-8.5 p.c. (8-10 somites) in the primitive streak region, which is the 
site at which ingressing cells form the mesoderm of the developing embryo. 
Expression was maintained in the primitive streak at day 8.75, but by day 9.5 p.c, 
when the tail bud replaces the primitive streak as the source of new mesodermal cells, 
expression of GDF-1 1 shifted to the tail bud. Hence, at these early stages, GDF-1 1 
appears to be synthesized in the region of the developing embryo where new 
mesodermal cells arise and presumably acquire their positional identity. 

The phenotype of GDF-1 1 knock-out mice in several respects resembled the 
phenotype of mice carrying a deletion of a receptor for some members of the TGF-p 
superfamily, the activin type IIB receptor (Act RUB). As in the case of GDF-1 1 
knock-out mice, the Act RIIB knock-out mice have extra pairs of ribs and a spectrum 
of kidney defects ranging from hypoplastic kidneys to complete absence of kidneys. 
The similarity in the phenotypes of these mice raises the possibility that Act RIIB can 
be a receptor for GDF-1 1 . However, Act RIIB cannot be the sole receptor for 
GDF-1 1 because the phenotype of GDF-1 1 knock-out mice is more severe than the 
phenotype of Act RIIB mice. For example, whereas the GDF-1 1 knock-out animals 
have 4-5 extra pairs of ribs and show homeotic transformations throughout the axial 
skeleton, the Act RIIB knock-out animals have only 3 extra pairs of ribs and do not 
show transformations at other axial levels. In addition, the data indicate that the 
kidney defects in the GDF-1 1 knock-out mice are also more severe than those in 
Act RIIB knock-out mice. The Act RIIB knock-out mice show defects in left/right 
axis formation, such as lung isomerism and a range of heart defects that we have not 
yet observed in GDF-1 1 knock-out mice. Act RIIB can bind the activins and certain 
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BMPs, although none of the knock-out mice generated for these ligands show defects 
in left/right axis formation. 

If GDF-1 1 does act directly on mesodermal cells to establish positional 
identity; the data presented here would be consistent with either short range or 
morphogen models for GDF-1 1 action. That is, GDF-1 1 can act on mesodermal 
precursors to establish patterns of Hox gene expression as these cells are being 
generated at the site of GDF-1 1 expression, or alternatively, GDF-1 1 produced at the 
posterior end of the embryo can diffuse to form a morphogen gradient. Whatever the 
mechanism of action of GDF-1 1 may be, the fact that gross anterior/posterior 
patterning still does occur in GDF-1 1 knock-out animals suggests that GDF-1 1 may 
not be the sole regulator of anterior/posterior specification. Nevertheless, it is clear 
that GDF-1 1 plays an important role as a global regulator of axial patterning and that 
further study of this molecule will lead to important new insights into how positional 
identity along the anterior/posterior axis is established in the vertebrate embryo. 

Similar phenotypes are expected in GDF-8 knock-out animals. For example, 
GDF-8 knock-out animals are expected to have increased number of ribs, kidney 
defects and anatomical differences when compared to wild type. 

Although the invention has been described with reference to the above examples, 
it will be understood that modifications and variations are encompassed within the spirit 
and scope of the invention. Accordingly, the invention is limited only by the following 
claims. 
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What is claimed is: 

1 . An isolated polynucleotide encoding a promyostatin polypeptide or a 
peptide .portion thereof, or a polynucleotide complementary thereto. 

5 2. The polynucleotide of claim 1, wherein the promyostatin polypeptide is a 

vertebrate promyostatin polypeptide. 

3. The polynucleotide of claim 2, wherein the vertebrate promyostatin 
polypeptide is a mammalian promyostatin polypeptide. 

10 

4. The polynucleotide of claim 3, wherein the mammalian promyostatin 
polypeptide is selected from 

a human promyostatin polypeptide comprising an amino acid sequence 
as set forth in SEQ ID NO: 2; 
15 a murine promyostatin polypeptide comprising an amino acid sequence 

as set forth in SEQ ID NO: 4; 

a rat promyostatin polypeptide comprising an amino acid sequence as 
set forth in SEQ ID NO: 6; 

a baboon promyostatin polypeptide comprising an amino acid 
20 sequence as set forth in SEQ ID NO: 1 0; 

a bovine promyostatin polypeptide comprising an amino acid sequence 
as set forth in SEQ ID NO: 12; 

a porcine promyostatin polypeptide comprising an amino acid 
sequence as set forth in SEQ ID NO: 14; and 
25 an ovine promyostatin polypeptide comprising an amino acid sequence 

as set forth in SEQ ID NO: 16. 

5. The polynucleotide of claim 2, wherein the vertebrate promyostatin 
polypeptide is an avian promyostatin polypeptide. 

30 
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6. The polynucleotide of claim 5, wherein the avian promyostatin polypeptide 
is selected from 

a chicken promyostatin polypeptide comprising an amino acid 
sequence as set forth in SEQ ID NO: 8; and 

a turkey promyostatin polypeptide comprising an amino acid sequence 
as set forth in SEQ ID NO: 1 8. 

7. The polynucleotide of claim 2, wherein the vertebrate promyostatin 
polypeptide is a piscine promyostatin polypeptide. 

8. The polynucleotide of claim 7, wherein the piscine promyostatin 
polypeptide is selected from 

a zebrafish promyostatin polypeptide comprising an amino acid 
sequence as set forth in SEQ ID NO: 20; 

a salmon allele 1 promyostatin polypeptide comprising an amino acid 
sequence as set forth in SEQ ID NO: 27; and 

a salmon allele 2 promyostatin polypeptide comprising an amino acid 
sequence as set forth in SEQ ID NO: 29. 

9. A vector, comprising a polynucleotide of claim 1. 

•10. The vector of claim 9, wherein said vector is an expression vector. 

1 1 . A host cell containing a polynucleotide of claim 1 . 

12. The host cell of claim 11, wherein said polynucleotide is in a vector. 

13. An oligonucleotide, comprising at least about 15 nucleotides that 
specifically hybridize to a polynucleotide encoding a promyostatin polypeptide or a 
peptide portion thereof, or to a polynucleotide complementary thereto. 
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14. The oligonucleotide of claim 13, wherein said promyostatin polypeptide 
or a peptide portion thereof is selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ ID NO: 18, SEQ ID NO 20, SEQ ID NO: 27 and SEQ ID NO: 29. 

15. The oligonucleotide of claim 13, wherein said polynucleotide encoding 
promyostatin polypeptide or a peptide portion thereof is selected from SEQ ID NO: 1 , 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 1 1, SEQ 
ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO 19, SEQ ID NO: 26 and 
SEQ ID NO: 28. 

16. The oligonucleotide of claim 13, wherein said oligonucleotide is an 
antisense nucleic acid molecule. 

17. A vector, comprising an oligonucleotide of claim 13. 

18. A host cell containing the vector of claim 17. 

19. An isolated polynucleotide encoding a mature myostatin peptide, or a 
polynucleotide complementary thereto. 

20. The polynucleotide of claim 19, wherein the myostatin peptide is a 
vertebrate myostatin peptide. 

21 . The polynucleotide of claim 20, wherein the vertebrate myostatin peptide 
is a mammalian myostatin peptide. 

22. The polynucleotide of claim 21, wherein the mammalian myostatin 
peptide comprises an amino acid sequence selected from 

amino acid residues about 267 to 374 as set forth in SEQ ID NO: 2; 
amino acid residues about 268 to 375 as set forth in SEQ ID NO: 4; 
amino acid residues about 268 to 375 as set forth in SEQ ID NO: 6; 
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amino acid residues about 267 to 374 as set forth in SEQ ID NO: 10; 
amino acid residues about 267 to 374 as set forth in SEQ ID NO: 12; 
amino acid residues about 267 to 374 as set forth in SEQ ID NO: 14; 

and 

amino acid residues about 267 to 374 as set forth in SEQ ID NO: 16. 

23. The polynucleotide of claim 20, wherein the vertebrate myostatin peptide 
is an avian myostatin peptide. 

24. The polynucleotide of claim 23, wherein the avian myostatin peptide 
comprises an amino acid sequence selected from 

amino acid residues about 267 to 374 as set forth in SEQ ID NO: 8; 

and 

amino acid residues about 267 to 374 as set forth in SEQ ID NO: 18. 

25. The polynucleotide of claim 20, wherein the vertebrate myostatin peptide 
is a piscine myostatin peptide. 

26. The polynucleotide of claim 25, wherein the piscine myostatin peptide 
comprises an amino acid sequence selected from 

amino acid residues about 267 to 374 as set forth in SEQ ID NO: 20; 
amino acid residues about 49 to 1 57 of SEQ ID NO: 27; and 
amino acid residues about 28 to 136 of SEQ ID NO: 29. 

27. A vector, comprising a polynucleotide of claim 19. 

28. The vector of claim 27, wherein said vector is a viral vector. 

29. A host cell containing the vector of claim 27. 
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POLYNUCLEOTIDES ENCODING PROMYOSTATIN POLYPEPTIDES 

ABSTRACT OF THE INVENTION 

The present invention provides isolated polynucleotides encoding 
promyostatin polypeptides or a peptide portion thereof, polynucleotides 
complementary thereto, and oligonucleotides that can specifically hybridize to such 
polynucleotides. The present invention also provides an isolated polynucleotide 
encoding a mature myostatin peptide. 
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With respect to the Application, I hereby claim the benefit under 35 U.S.C. 
Section 1 19(e) of any United States provisional application(s) listed below: 



(Application Serial No.) (Filing Date) 

With respect to the Application, I hereby claim the benefit under 35 U.S.C. 
Section 1 19 of any foreign application(s) for patent or inventor's certificate or of any PCT 
international application(s) designating at least one country other than the United States of 
America listed below and have also identified below any foreign application for patent or 
inventor's certificate or any PCT international application(s) designating at least one country 
other than the United States of America filed by me on the same subject matter having a filing 
date before that of the application(s) of which priority is claimed: 

COUNTRY APPLICATION NO. FILING DATE PRIORITY 

CLAIMED 



With respect to the Application, I hereby claim the benefit under 35 U.S.C. 

Section 120 of any United States application(s), or Section 365(c) of any PCT International 

application designating the United States, listed below and, insofar as the subject matter of each 

of the claims of the application is not disclosed in the prior United States or PCT International 

application in the manner provided by the first paragraph of 35 U.S.C. Section 1 12, 1 

acknowledge the duty to disclose to the United States Patent and Trademark Office all 

information known to me to be material to patentability of the subject matter of the Application 

as defined in Title 37, C.F.R., Section 1.56 which became available between the filing date of the 

prior application and the national or PCT International filing date of the Application: 

09/628.112 7/27/00 Pending 

(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 

09/124.180 7/28/98 Pending 

(Application Serial No.) (Filing Date) (Status) 

(patented, pending, abandoned) 
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1 

SEQUENCE LISTING 



<110> Johns Hopkins University School of Medicine 
Se-Jin, Lee 
Al exandra , Mc Phe r ron 

<12 0> POLYNUCLEOTIDES ENCODING PROM YO S TAT I N POLYPEPTIDES 

<130> JHU1120-15 

<14 0> US # UNAS SIGNED 
<141> 2000-11-07 

<150> 09/628,112 
<151> 2000-07-27 

<150> 09/124,180 
<151> 1998-07-28 

<150> 09/019,070 
<151> 1998-02-05 

<150> 08/862,445 
<151> 1997-05-23 

<150> 08/847,910 
<151> 1997-04-28 

<150> 08/795,071 
<151> 1997-02-05 

<150> 08/525,596 
<151> 1995-10-26 

<160> 29 

<170> Patentln version 3.0 

<210> 1 

<211> 2743 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (59) . . (1183) 

<400> 1 

aagaaaagta aaaggaagaa acaagaacaa gaaaaaagat tatattgatt ttaaaatc 5 8 

atg caa aaa ctg caa etc tgt gtt tat att tac ctg ttt atg ctg att 106 
Met Gin Lys Leu Gin Leu Cys Val Tyr lie Tyr Leu Phe Met Leu lie 
15 10 15 

gtt get ggt cca gtg gat eta aat gag aac agt gag caa aaa gaa aat 154 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 



gtg gaa aaa gag ggg ctg tgt aat gca tgt act tgg aga caa aac act 2 02 

Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 
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aaa tct tea aga ata gaa gec att aag ata caa ate etc agt aaa ctt 
Lys Ser Ser Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 



cgt ctg gaa aca get cct aac ate age aaa gat gtt ata aga caa ctt 
Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Val He Arg Gin Leu 



tta ccc aaa get cct cca etc egg gaa ctg att gat cag tat gat gtc 
Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 



cag agg gat gac age age gat ggc tct ttg gaa gat gac gat tat cac 

Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

get aca acg gaa aca ate att ace atg cct aca gag tct gat ttt eta 

Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 

atg caa gtg gat gga aaa ccc aaa tgt tgc ttc ttt aaa ttt age tct 

Met Gin Val Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aaa ata caa tac aat aaa gta gta aag gec caa eta tgg ata tat ttg 

Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 

145 150 155 160 

aga ccc gtc gag act cct aca aca gtg ttt gtg caa ate ctg aga etc 

Arg Pro Val Glu Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 

ate aaa cct atg aaa gac ggt aca agg tat act gga ate cga tct ctg 

He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

aaa ctt gac atg aac cca ggc act ggt att tgg cag age att gat gtg 

Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 

aag aca gtg ttg caa aat tgg etc aaa caa cct gaa tec aac tta ggc 

Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

att gaa ata aaa get tta gat gag aat ggt cat gat ctt get gta ace 

He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 

225 230 235 240 

ttc cca gga cca gga gaa gat ggg ctg aat ccg ttt tta gag gtc aag 

Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 

gta aca gac aca cca aaa aga tec aga agg gat ttt ggt ctt gac tgt 

Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 

gat gag cac tea aca gaa tea cga tgc tgt cgt tac cct eta act gtg 

Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 
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gat ttt gaa get ttt gga tgg gat tgg att ate get cct aaa aga tat 

Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 
290 295 300 

aag gec aat tac tgc tct gga gag tgt gaa ttt gta ttt tta caa aaa 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 

tat cct cat act cat ctg gta cac caa gca aac ccc aga ggt tea gca 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 

ggc cct tgc tgt act ccc aca aag atg tct cca att aat atg eta tat 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 

ttt aat ggc aaa gaa caa ata ata tat ggg aaa att cca gcg atg gta 
Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val 
355 360 365 

gta gac cgc tgt ggg tgc tea tgagatttat attaagegtt cataacttcc 
Val Asp Arg Cys Gly Cys Ser 
370 375 



taaaacatgg 


aaggttttcc 


cctcaacaat 


tttgaagctg 


tgaaattaag 


taccacaggc 


1273 


tataggecta 


gagtatgeta 


cagtcactta 


agcataagct 


acagtatgta 


aactaaaagg 


1333 


gggaatatat 


gcaatggttg 


gcatttaacc 


atccaaacaa 


atcatacaag 


aaagttttat 


1393 


gatttccaga 


gtttttgagc 


tagaaggaga 


tcaaattaca 


tttatgttcc 


tat at at tac 




aacateggeg 


aggaaatgaa 


agegattetc 


cttgagttct 


gatgaattaa 


aggagtatgc 


1513 


tttaaagtct 


atttctttaa 


agttttgttt 


aatatttaca 


gaaaaatcca 


catacagtat 


1573 


tggtaaaatg 


caggattgtt 


atataccatc 


attcgaatca 


tccttaaaca 


cttgaattta 


1633 


tattgtatgg 


tagtatactt 


ggtaagataa 


aattccacaa 


aaatagggat 


ggtgcagcat 


1693 


atgeaattte 


cattcctatt 


ataattgaca 


cagtacatta 


acaatccatg 


ccaacggtgc 


1753 


taatacgata 


ggctgaatgt 


ctgaggctac 


caggtttatc 


acataaaaaa 


cattcagtaa 


1813 


aatagtaagt 


ttctcttttc 


ttcaggtgca 


ttttcctaca 


cctccaaatg 


aggaatggat 


1873 


tttctttaat 


gtaagaagaa 


tcatttttct 


agaggttggc 


tttcaattct 


gtagcatact 


1933 


tggagaaact 


gcattatctt 


aaaaggcagt 


caaatggtgt 


ttgtttttat 


caaaatgtca 


1993 


aaataacata 


cttggagaag 


tatgtaattt 


tgtctttgga 


aaattacaac 


actgectttg 


2053 


caacactgca 


gtttttatgg 


taaaataata 


gaaatgatcg 


actctatcaa 


tattgtataa 


2113 


aaagactgaa 


acaatgeatt 


tatataatat 


gtatacaata 


ttgttttgta 


aataagtgtc 


2173 


tcctttttta 


tttactttgg 


tatattttta 


cactaaggac 


atttcaaatt 


aagtactaag 


2233 


gcacaaagac 


atgtcatgca 


tcacagaaaa 


gcaactactt 


atatttcaga 


gcaaattagc 


2293 


agattaaata 


gtggtcttaa 


aactccatat 


gttaatgatt 


agatggttat 


attacaatca 


2353 
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ttttatattt ttttacatga ttaacattca cttatggatt catgatggct gtataaagtg 2413 

aatttgaaat ttcaatggtt tactgtcatt gtgtttaaat ctcaacgttc cattatttta 2473 

atacttgcaa aaacattact aagtatacca aaataattga ctctattatc tgaaatgaag 2533 

aataaactga tgctatctca acaataactg ttacttttat tttataattt gataatgaat 2593 

atatttctgc atttatttac ttctgttttg taaattggga ttttgttaat caaatttatt 2653 

gtactatgac taaatgaaat tatttcttac atctaatttg tagaaacagt ataagttata 2713 

ttaaagtgtt ttcacatttt tttgaaagac 2743 

<210> 2 

<211> 375 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Gin Lys Leu Gin Leu Cys Val Tyr lie Tyr Leu Phe Met Leu lie 
1 5 10 15 

Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 

Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 

Lys Ser Ser Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 
50 55 60 

Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Val He Arg Gin Leu 
65 70 75 80 

Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 



Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 



Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 



Met Gin Val Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 
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Arg Pro Val Glu Thr Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu 
165 170 175 



lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu 
180 185 190 



Lys Leu Asp Met Asn Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



lie Glu lie Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 
290 295 300 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 



Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 3 
<211> 2676 
<212> DNA 



<213> Mus mus cuius 



<220> 

<221> CDS 

<222> (104) . . (1231) 



<400> 3 

gtctctcgga cggtacatgc actaatattt cacttggcat tactcaaaag caaaaagaag 

aaataagaac aagggaaaaa aaaagattgt gctgattttt aaa atg atg caa aaa 

Met Met Gin Lys 



ctg caa atg tat gtt tat att tac ctg ttc atg ctg att get get ggc 
Leu Gin Met Tyr Val Tyr He Tyr Leu Phe Met Leu He Ala Ala Gly 
5 10 15 20 

cca gtg gat eta aat gag ggc agt gag aga gaa gaa aat gtg gaa aaa 
Pro Val Asp Leu Asn Glu Gly Ser Glu Arg Glu Glu Asn Val Glu Lys 
25 30 35 

gag ggg ctg tgt aat gca tgt gcg tgg aga caa aac acg agg tac tec 
Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn Thr Arg Tyr Ser 
40 45 50 

aga ata gaa gec ata aaa att caa ate etc agt aag ctg cgc ctg gaa 
Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

aca get cct aac ate age aaa gat get ata aga caa ctt ctg cca aga 
Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu Leu Pro Arg 



gcg cct cca etc egg gaa ctg ate gat cag tac gac gtc cag agg gat 
Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

gac age agt gat ggc tct ttg gaa gat gac gat tat cac get ace acg 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 HO H5 

gaa aca ate att acc atg cct aca gag tct gac ttt eta atg caa gcg 
Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu Met Gin Ala 
120 125 130 

gat ggc aag ccc aaa tgt tgc ttt ttt aaa ttt age tct aaa ata cag 
Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 
135 140 145 

tac aac aaa gta gta aaa gee caa ctg tgg ata tat etc aga ccc gtc 
Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Pro Val 
150 155 160 

aag act cct aca aca gtg ttt gtg caa ate ctg aga etc ate aaa ccc 
Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu lie Lys Pro 
165 170 175 180 

atg aaa gac ggt aca agg tat act gga ate cga tct ctg aaa ctt gac 
Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 
185 190 195 
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atg age cca ggc act ggt att tgg cag agt att gat gtg aag aca gtg 73 9 

Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 

ttg caa aat tgg etc aaa cag cct gaa tec aac tta ggc att gaa ate 787 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

aaa get ttg gat gag aat ggc cat gat ctt get gta acc ttc cca gga 835 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

cca gga gaa gat ggg ctg aat ccc ttt tta gaa gtc aag gtg aca gac 883 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

aca ccc aag agg tec egg aga gac ttt ggg ctt gac tgc gat gag cac 931 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

tec acg gaa tec egg tgc tgc cgc tac ccc etc acg gtc gat ttt gaa 979 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

gee ttt gga tgg gac tgg att ate gca ccc aaa aga tat aag gec aat 1027 
Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

tac tgc tea gga gag tgt gaa ttt gtg ttt tta caa aaa tat ccg cat 1075 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

act cat ctt gtg cac caa gca aac ccc aga ggc tea gca ggc cct tgc 1123 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

tgc act ccg aca aaa atg tct ccc att aat atg eta tat ttt aat ggc 1171 
Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

aaa gaa caa ata ata tat ggg aaa att cca gee atg gta gta gac cgc 1219 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 

tgt ggg tgc tea tgagctttgc attaggttag aaacttccca agtcatggaa 12 71 
Cys Gly Cys Ser 
375 

ggtcttcccc tcaatttcga aactgtgaat tcaagcacca caggctgtag gecttgagta 1331 

tgctctagta aegtaagcac aagctacagt gtatgaacta aaagagagaa tagatgeaat 13 91 

ggttggcatt caaccaccaa aataaaccat actataggat gttgtatgat ttccagagtt 1451 

tttgaaatag atggagatca aattacattt atgtccatat atgtatatta caactacaat 1511 

ctaggcaagg aagtgagagc acatcttgtg gtctgetgag ttaggagggt atgattaaaa 15 71 

ggtaaagtct tatttcctaa cagtttcact taatatttac agaagaatct atatgtagee 1631 

tttgtaaagt gtaggattgt tatcatttaa aaacatcatg tacacttata tttgtattgt 1691 
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atacttggta 


agataaaatt 


ccacaaagta 


ggaatggggc 


ctcacataca 


cattgccatt 


1751 


cctattataa 


ttggacaatc 


caccacggtg 


ctaatgcagt 


gctgaatggc 


tcctactgga 


1811 


cctctcgata 


gaacactcta 


caaagtacga 


gtctctctct 


cccttccagg 


tgcatctcca 


1871 


cacacacagc 


actaagtgtt 


caatgcattt 


tctttaagga 


aagaagaatc 


tttttttcta 


1931 


gaggtcaact 


ttcagtcaac 


tctagcacag 


cgggagtgac 


tgctgcatct 


taaaaggcag 


1991 


ccaaacagta 


ttcatttttt 


aatctaaatt 


tcaaaatcac 


tgtctgcctt 


tatcacatgg 


2051 


caattttgtg 


gtaaaataat 


ggaaatgact 


ggttctatca 


atattgtata 


aaagactctg 


2111 


aaacaattac 


atttatataa 


tatgtataca 


atattgtttt 


gtaaataagt 


gtctcctttt 


2171 


atatttactt 


tggtatattt 


ttacactaat 


gaaatttcaa 


atcattaaag 


tacaaagaca 


2231 


tgtcatgtat 


cacaaaaaag 


gtgactgctt 


ctatttcaga 


gtgaattagc 


agattcaata 


2291 


gtggtcttaa 


aactctgtat 


gttaagatta 


gaaggttata 


ttacaatcaa 


tttatgtatt 


2351 


ttttacatta 


tcaacttatg 


gtttcatggt 


ggctgtatct 


atgaatgtgg 


ctcccagtca 


2411 


aatttcaatg 


ccccaccatt 


ttaaaaatta 


caagcattac 


taaacatacc 


aacatgtatc 


2471 


taaagaaata 


caaatatggt 


atctcaataa 


cagctacttt 


tttattttat 


aatttgacaa 


2531 


tgaatacatt 


tcttttattt 


acttcagttt 


tataaattgg 


aactttgttt 


atcaaatgta 


2591 


ttgtactcat 


agctaaatga 


aattatttct 


tacataaaaa 


tgtgtagaaa 


ctataaatta 


2651 


aagtgttttc 


acatttttga 


aaggc 








2676 



<210> 4 

<211> 376 

<212> PRT 

<213> Mus musculus 

<400> 4 

Met Met Gin Lys Leu Gin Met Tyr Val Tyr He Tyr Leu Phe Met Leu 
15 10 15 

He Ala Ala Gly Pro Val Asp Leu Asn Glu Gly Ser Glu Arg Glu Glu 
20 25 30 

Asn Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn 
35 40 45 

Thr Arg Tyr Ser Arg He Glu Ala lie Lys He Gin He Leu Ser Lys 
50 55 60 

Leu Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin 
65 70 75 80 



Leu Leu Pro Arg Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp 
85 90 95 



Val Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr 
100 105 110 



His Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe 
115 120 125 



Leu Met Gin Ala Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser 
130 135 140 



Ser Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr 
145 150 155 160 



Leu Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gin lie Leu Arg 
165 170 175 



Leu lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser 
180 185 190 



Leu Lys Leu Asp Met Ser Pro Gly Thr Gly lie Trp Gin Ser lie Asp 
195 200 205 



Val Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu 
210 215 220 



Gly lie Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val 
225 230 235 240 



Thr Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val 
245 250 255 



Lys Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp 
260 265 270 



Cys Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr 
275 280 285 



Val Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg 
290 295 300 



Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin 
305 310 315 320 



Lys Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser 
325 330 335 



Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu 
340 345 350 



Tyr Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met 
355 360 365 



Val Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 5 

<211> 1131 

<212> DNA 

<213> Rattus norvegicus 
<220> 

<221> CDS 

<222> (1) . . (1128) 

<400> 5 

atg att caa aaa ccg caa atg tat gtt tat att tac ctg ttt gtg ctg 
Met lie Gin Lys Pro Gin Met Tyr Val Tyr lie Tyr Leu Phe Val Leu 



att get get ggc cca gtg gat eta aat gag gac agt gag aga gag gcg 
lie Ala Ala Gly Pro Val Asp Leu Asn Glu Asp Ser Glu Arg Glu Ala 



aat gtg gaa aaa gag ggg ctg tgt aat gcg tgt gcg tgg aga caa aac 
Asn Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn 



aca agg tac tec aga ata gaa gee ata aaa att caa ate etc agt aaa 
Thr Arg Tyr Ser Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys 



etc cgc ctg gaa aca gcg cct aac ate age aaa gat get ata aga caa 
Leu Arg Leu Glu Thr Ala Pro Asn lie Ser Lys Asp Ala lie Arg Gin 



ctt ctg ccc aga gcg cct cca etc egg gaa ctg ate gat cag tac gac 
Leu Leu Pro Arg Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp 



gtc cag agg gat gac age agt gac ggc tct ttg gaa gat gac gat tat 

Val Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr 
100 105 110 

cac get acc acg gaa aca ate att acc atg cct ace gag tct gac ttt 

His Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe 
115 120 125 



eta atg caa gcg gat gga aag ccc aaa tgt tgc ttt ttt aaa ttt age 432 
Leu Met Gin Ala Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser 



tct aaa ata cag tac aac aaa gtg gta aag gcc cag ctg tgg ata tat 
Ser Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr 
145 150 155 ISO 

ctg aga gcc gtc aag act cct aca aca gtg ttt gtg caa ate ctg aga 
Leu Arg Ala Val Lys Thr Pro Thr Thr Val Phe Val Gin lie Leu Arg 
165 170 175 

etc ate aaa ccc atg aaa gac ggt aca agg tat acc gga ate cga tct 
Leu lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser 
180 185 190 

ctg aaa ctt gac atg age cca ggc act ggt att tgg cag agt att gat 
Leu Lys Leu Asp Met Ser Pro Gly Thr Gly lie Trp Gin Ser lie Asp 
195 200 205 

gtg aag aca gtg ttg caa aat tgg etc aaa cag cct gaa tec aac tta 
Val Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu 
210 215 220 

ggc att gaa ate aaa get ttg gat gag aat ggg cat gat ctt get gta 
Gly lie Glu lie Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val 
225 230 235 240 

acc ttc cca gga cca gga gaa gat ggg ctg aat ccc ttt tta gaa gtc 
Thr Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val 
245 250 255 

aaa gta aca gac aca ccc aag agg tec egg aga gac ttt ggg ctt gac 
Lys Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp 
260 265 270 

tgc gat gaa cac tec acg gaa teg egg tgc tgt cgc tac ccc etc acg 
Cys Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr 
275 280 285 

gtc gat ttc gaa gcc ttt gga tgg gac tgg att att gca ccc aaa aga 
Val Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg 
290 295 300 

tat aag get aat tac tgc tct gga gag tgt gaa ttt gtg ttc tta caa 
Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin 
305 310 315 320 

aaa tat ccg cat act cat ctt gtg cac caa gca aac ccc aga ggc teg 
Lys Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser 
325 330 335 

gca ggc cct tgc tgc acg cca aca aaa atg tct ccc att aat atg eta 
Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu 
340 345 350 

tat ttt aat ggc aaa gaa caa ata ata tat ggg aaa att cca gcc atg 
Tyr Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met 
355 360 365 

gta gta gac egg tgt ggg tgc teg tga 
Val Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 6 

<211> 376 

<212> PRT 

<213> Rattus norvegicus 

<400> 6 

Met He Gin Lys Pro Gin Met Tyr Val Tyr He Tyr Leu Phe Val Leu 
1 5 10 15 

He Ala Ala Gly Pro Val Asp Leu Asn Glu Asp Ser Glu Arg Glu Ala 
20 25 30 

Asn Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn 
35 40 45 

Thr Arg Tyr Ser Arg He Glu Ala He Lys He Gin He Leu Ser Lys 
50 55 60 

Leu Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin 
65 70 75 80 

Leu Leu Pro Arg Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp 



Val Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr 
100 105 HO 

His Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe 
115 120 125 



Leu Met Gin Ala Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser 
130 135 140 



Ser Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr 
145 150 155 160 



Leu Arg Ala Val Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg 
165 170 175 



Leu He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser 
180 185 190 



Leu Lys Leu Asp Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp 
195 200 205 
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Val Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu 
210 215 220 



Gly lie Glu lie Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val 
225 230 235 240 



Thr Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val 
245 250 255 



Lys Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp 
260 265 270 



Cys Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr 
275 280 285 



Val Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg 
290 295 300 



Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin 
305 310 315 320 



Lys Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser 
325 330 335 



Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu 
340 345 350 



Tyr Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met 
355 360 365 



Val Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 7 

<211> 1128 

<212> DNA 

<213> Gallus gallus 

<220> 

<221> CDS 

<222> (1) . . (1125) 

<400> 7 

atg caa aag ctg gca gtc tat gtt tat att tac ctg ttc atg cag ate 
Met Gin Lys Leu Ala Val Tyr Val Tyr lie Tyr Leu Phe Met Gin lie 



gcg gtt gat ccg gtg get ctg gat ggc agt agt cag ccc aca gag aac 
Ala Val Asp Pro Val Ala Leu Asp Gly Ser Ser Gin Pro Thr Glu Asn 



get gaa aaa gac gga ctg tgc aat get tgt acg tgg aga cag aat aca 
Ala Glu Lys Asp Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 



aaa tec tec aga ata gaa gee ata aaa att caa ate etc age aaa ctg 
Lys Ser Ser Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 



cgc ctg gaa caa gca cct aac att age agg gac gtt att aag cag ctt 
Arg Leu Glu Gin Ala Pro Asn lie Ser Arg Asp Val lie Lys Gin Leu 



tta ccc aaa get cct cca ctg cag gaa ctg att gat cag tat gat gtc 
Leu Pro Lys Ala Pro Pro Leu Gin Glu Leu lie Asp Gin Tyr Asp Val 



cag agg gac gac agt age gat ggc tct ttg gaa gac gat gac tat cat 

Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 

100 105 110 

gee aca ace gag acg att ate aca atg cct acg gag tct gat ttt ctt 

Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu 

115 120 125 

gta caa atg gag gga aaa cca aaa tgt tgc ttc ttt aag ttt age tct 

Val Gin Met Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aaa ata caa tat aac aaa gta gta aag gca caa tta tgg ata tac ttg 

Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu 

145 150 155 160 

agg caa gtc caa aaa cct aca acg gtg ttt gtg cag ate ctg aga etc 

Arg Gin Val Gin Lys Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu 

165 170 175 

att aag ccc atg aaa gac ggt aca aga tat act gga att cga tct ttg 

lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu 

180 185 190 

aaa ctt gac atg aac cca ggc act ggt ate tgg cag agt att gat gtg 

Lys Leu Asp Met Asn Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val 

195 200 205 

aag aca gtg ctg caa aat tgg etc aaa cag cct gaa tec aat tta ggc 

Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

ate gaa ata aaa get ttt gat gag act gga cga gat ctt get gtc aca 

lie Glu lie Lys Ala Phe Asp Glu Thr Gly Arg Asp Leu Ala Val Thr 

225 230 235 240 

ttc cca gga cca gga gaa gat gga ttg aac cca ttt tta gag gtc aga 

Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Arg 

245 250 255 

gtt aca gac aca ccg aaa egg tec cgc aga gat ttt ggc ctt gac tgt 

Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 

260 265 270 
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gat gag cac tea acg gaa tec cga tgt tgt cgc tac ccg ctg aca gtg 864 
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 

gat ttc gaa get ttt gga tgg gac tgg att ata gca cct aaa aga tac 912 
Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 
290 295 300 

aaa gec aat tac tgc tec gga gaa tgc gaa ttt gtg ttt eta cag aaa 960 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 

tac ccg cac act cac ctg gta cac caa gca aat ccc aga ggc tea gca 1008 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 

ggc cct tgc tgc aca ccc acc aag atg tec cct ata aac atg ctg tat 1056 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 

ttc aat gga aaa gaa caa ata ata tat gga aag ata cca gee atg gtt 1104 
Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val 
355 360 365 

gta gat cgt tgc ggg tgc tea tga 112 8 

Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 8 

<211> 375 

<212> PRT 

<213> Gallus gallus 

<400> 8 

Met Gin Lys Leu Ala Val Tyr Val Tyr He Tyr Leu Phe Met Gin He 
15 10 15 

Ala Val Asp Pro Val Ala Leu Asp Gly Ser Ser Gin Pro Thr Glu Asn 
20 25 30 

Ala Glu Lys Asp Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 

Lys Ser Ser Arg lie Glu Ala He Lys He Gin He Leu Ser Lys Leu 
50 55 60 

Arg Leu Glu Gin Ala Pro Asn He Ser Arg Asp Val He Lys Gin Leu 
65 70 75 80 

Leu Pro Lys Ala Pro Pro Leu Gin Glu Leu He Asp Gin Tyr Asp Val 



Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 



100 



16 

105 



110 



Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 



Val Gin Met Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu 
145 150 155 160 



Arg Gin Val Gin Lys Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu 
165 170 175 



lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu 
180 185 190 



Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



He Glu He Lys Ala Phe Asp Glu Thr Gly Arg Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Arg 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr 
340 345 350 



17 



Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 9 
<211> 1128 



<212> DNA 
<213> Baboon 



<220> 

<221> CDS 

<222> (1) . . (1125) 



<400> 9 

atg caa aaa ctg caa etc tgt gtt tat att tac ctg ttt atg ctg att 
Met Gin Lys Leu Gin Leu Cys Val Tyr He Tyr Leu Phe Met Leu He 



gtt get ggt cca gtg gat eta aat gag aac agt gag caa aaa gaa aat 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 



gtg gaa aaa gag ggg ctg tgt aat gca tgt act tgg aga caa aac act 
Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 



aaa tct tea aga ata gaa gee att aaa ata caa ate etc agt aaa ctt 
Lys Ser Ser Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu 



cgt ctg gaa aca get cct aac ate age aaa gat get ata aga caa ctt 
Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu 



tta ccc aaa gcg cct cca etc egg gaa ctg att gat cag tat gat gtc 
Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 



cag agg gat gac age age gat ggc tct ttg gaa gat gac gat tat cac 
Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

get aca acg gaa aca ate att ace atg cct aca gag tct gat ttt tta 
Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 

atg caa gtg gat gga aaa ccc aaa tgt tgc ttc ttt aaa ttt age tct 
Met Gin Val Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aaa ata caa tac aat aaa gtg gta aag gec caa eta tgg ata tat ttg 
Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 



aga ccc gtc gag act cct aca aca gtg ttt gtg caa ate ctg aga etc 



528 



18 

Arg Pro Val Glu Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 



ate aaa cct atg aaa gac ggt aca agg tat act gga ate cga tct ctg 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

aaa ctt gac atg aac cca ggc act ggt att tgg cag age att gat gtg 
Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 

aag aca gtg ttg caa aat tgg etc aaa caa cct gaa tec aac tta ggc 
Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

att gaa ata aaa get tta gat gag aat ggt cat gat ctt get gta acc 
He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 

ttc cca gga cca gga gaa gat ggg ctg aat ccc ttt tta gag gtc aag 
Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 

gta aca gac aca ccc aaa aga tec aga agg gat ttt ggt ctt gac tgt 
Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 

gat gag cac tea aca gaa teg cga tgc tgt cgt tac cct eta act gtg 
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 

gat ttt gaa get ctt gga tgg gat tgg att ate get cct aaa aga tat 
Asp Phe Glu Ala Leu Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 

aag gee aat tac tgc tct gga gag tgt gaa ttt gta ttt tta caa aaa 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 

tat cct cat act cat ctg gta cac caa gca aac ccc aga ggt tea gca 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 

ggc cct tgc tgt act ccc aca aag atg tct cca att aat atg eta tat 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr 
340 345 350 

ttt aat ggc aaa gaa caa ata ata tat ggg aaa att cca gee atg gta 
Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val 
355 360 365 

gta gac cgc tgc ggg tgc tea tga 
Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 10 

<211> 375 

<212> PRT 

<213> Baboon 



<400> 10 

Met Gin Lys Leu Gin Leu Cys Val Tyr He Tyr Leu Phe Met Leu He 
1 5 10 is 

Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 

Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 

Lys Ser Ser Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu 
50 55 60 

Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu 
65 70 75 80 

Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 



Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 



Met Gin Val Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 ifio 



Arg Pro Val Glu Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 

He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Leu Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 
290 295 300 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 



Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 


11 


<211> 


1128 


<212> 


DNA 


<213> 


Bovine 


<220> 




<221> 


CDS 


<222> 


(1) . . (1125) 


<400> 


11 


atg caa aaa ctg caa 


Met Gin Lys Leu Gin 


1 


5 



gtt get ggc cca gtg gat ctg aat gag aac age gag cag aag gaa aat 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 



gtg gaa aaa gag ggg ctg tgt aat gca tgt ttg tgg agg gaa aac act 
Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Leu Trp Arg Glu Asn Thr 



aca teg tea aga eta gaa gec ata aaa ate caa ate etc agt aaa ctt 192 
Thr Ser Ser Arg Leu Glu Ala He Lys He Gin He Leu Ser Lys Leu 



cgc ctg gaa aca get cct aac ate age aaa gat get ate aga caa ctt 
Arg Leu Glu Thr Ala Pro Asn lie Ser Lys Asp Ala lie Arg Gin Leu 



ttg ccc aag get cct cca etc ctg gaa ctg att gat cag ttc gat gtc 
Leu Pro Lys Ala Pro Pro Leu Leu Glu Leu He Asp Gin Phe Asp Val 



cag aga gat gee age agt gac ggc tec ttg gaa gac gat gac tac cac 
Gin Arg Asp Ala Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

gec agg acg gaa acg gtc att ace atg ccc acg gag tct gat ctt eta 
Ala Arg Thr Glu Thr Val He Thr Met Pro Thr Glu Ser Asp Leu Leu 
115 120 125 

acg caa gtg gaa gga aaa ccc aaa tgt tgc ttc ttt aaa ttt age tct 
Thr Gin Val Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aag ata caa tac aat aaa eta gta aag gec caa ctg tgg ata tat ctg 
Lys He Gin Tyr Asn Lys Leu Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 

agg cct gtc aag act cct gcg aca gtg ttt gtg caa ate ctg aga etc 
Arg Pro Val Lys Thr Pro Ala Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 

ate aaa ccc atg aaa gac ggt aca agg tat act gga ate cga tct ctg 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

aaa ctt gac atg aac cca ggc act ggt att tgg cag age att gat gtg 
Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 

aag aca gtg ttg cag aac tgg etc aaa caa cct gaa tec aac tta ggc 
Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

att gaa ate aaa get tta gat gag aat ggc cat gat ctt get gta ace 
He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 

ttc cca gaa cca gga gaa gat gga ctg act ccc ttt tta gaa gtc aag 
Phe Pro Glu Pro Gly Glu Asp Gly Leu Thr Pro Phe Leu Glu Val Lys 
245 250 255 

gta aca gac aca cca aaa aga tct agg aga gat ttt ggg ctt gat tgt 
Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 

gat gaa cac tec aca gaa tct cga tgc tgt cgt tac cct eta act gtg 
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 

gat ttt gaa get ttt gga tgg gat tgg att att gca cct aaa aga tat 
Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 



aag gcc aat tac tgc tct gga gaa tgt gaa ttt gta ttt ttg caa aag 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 

tat cct cat acc cat ctt gtg cac caa gca aac ccc aga ggt tea gcc 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 

ggc ccc tgc tgt act cct aca aag atg tct cca att aat atg eta tat 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 

ttt aat ggc gaa gga caa ata ata tac ggg aag att cca gcc atg gta 
Phe Asn Gly Glu Gly Gin lie lie Tyr Gly Lys lie Pro Ala Met Val 
355 360 365 

gta gat cgc tgt ggg tgt tea tga 
Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 12 

<211> 375 

<212> PRT 

<213> Bovine 

<400> 12 

Met Gin Lys Leu Gin lie Ser Val Tyr He Tyr Leu Phe Met Leu lie 
15 10 15 

Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 

Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Leu Trp Arg Glu Asn Thr 
35 40 45 

Thr Ser Ser Arg Leu Glu Ala He Lys He Gin He Leu Ser Lys Leu 
50 55 60 

Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu 
65 70 75 80 

Leu Pro Lys Ala Pro Pro Leu Leu Glu Leu He Asp Gin Phe Asp Val 



Gin Arg Asp Ala Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 



Ala Arg Thr Glu Thr Val He Thr Met Pro Thr Glu Ser Asp Leu Leu 
115 120 125 



23 

Thr Gin Val Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys lie Gin Tyr Asn Lys Leu Val Lys Ala Gin Leu Trp lie Tyr Leu 
145 150 155 160 



Arg Pro Val Lys Thr Pro Ala Thr Val Phe Val Gin lie Leu Arg Leu 
165 170 175 



lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu 
180 185 190 



Lys Leu Asp Met Asn Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



lie Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Glu Pro Gly Glu Asp Gly Leu Thr Pro Phe Leu Glu Val Lys 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr 
340 345 350 



Phe Asn Gly Glu Gly Gin He He Tyr Gly Lys lie Pro Ala Met Val 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 



<210> 13 

<211> 1128 

<212> DNA 

<213> Porcine 

<220> 

<221> CDS 

<222> (1) . . (1125) 



atg caa aaa ctg caa ate tat gtt tat att tac ctg ttt atg ctg att 
Met Gin Lys Leu Gin lie Tyr Val Tyr lie Tyr Leu Phe Met Leu lie 
15 10 15 

gtt get ggt ccc gtg gat ctg aat gag aac age gag caa aag gaa aat 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 



gtg gaa aaa gag ggg ctg tgt aat gca tgt atg tgg aga caa aac act 
Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Met Trp Arg Gin Asn Thr 



aaa tct tea aga eta gaa gec ata aaa att caa ate etc agt aaa ctt 
Lys Ser Ser Arg Leu Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 



cgc ctg gaa aca get cct aac att age aaa gat get ata aga caa ctt 
Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu 
65 70 75 80 

ttg ccc aaa get cct cca etc egg gaa ctg att gat cag tac gat gtc 
Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 



cag aga gat gac age agt gat ggc tec ttg gaa gat gat gat tat cac 
Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

get acg acg gaa acg ate att ace atg cct aca gag tct gat ctt eta 
Ala Thr Thr Glu Thr lie He Thr Met Pro Thr Glu Ser Asp Leu Leu 
115 120 125 

atg caa gtg gaa gga aaa ccc aaa tgc tgc ttc ttt aaa ttt age tct 
Met Gin Val Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aaa ata caa tac aat aaa gta gta aag gee caa ctg tgg ata tat ctg 
Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 

aga ccc gtc aag act cct aca aca gtg ttt gtg caa ate ctg aga etc 
Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 



ate aaa ccc atg aaa gac ggt aca agg tat act gga ate cga tct ctg 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 



576 



25 

aaa ctt gac atg aac cca ggc act ggt att tgg cag age att gat gtg 

Lys Leu Asp Met Asn Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val 

195 200 205 

aag aca gtg ttg caa aat tgg etc aaa caa cct gaa tec aac tta ggc 
Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 

210 215 220 

att gaa ate aaa get tta gat gag aat ggt cat gat ctt get gta ace 
lie Glu lie Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 

225 230 235 240 

ttc cca gga cca gga gaa gat ggg ctg aat ccc ttt tta gaa gtc aag 

Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 

245 250 255 

gta aca gac aca cca aaa aga tec agg aga gat ttt gga etc gac tgt 

Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 

260 265 270 

gat gag cac tea aca gaa tct cga tgc tgt cgt tac cct eta act gtg 

Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 

275 280 285 

gat ttt gaa get ttt gga tgg gac tgg att att gca ccc aaa aga tat 

Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 

290 295 300 

aag gee aat tac tgc tct gga gag tgt gaa ttt gta ttt tta caa aaa 

Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 

305 310 315 320 

tac cct cac act cat ctt gtg cac caa gca aac ccc aga ggt tea gca 

Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 

325 330 335 

ggc ccc tgc tgt act ccc aca aag atg tct cca ate aat atg eta tat 

Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 

340 345 350 

ttt aat ggc aaa gaa caa ata ata tat ggg aaa att cca gec atg gta 

Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val 

355 360 365 

gta gat cgc tgt ggg tgc tea tga 
Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 14 

<211> 375 

<212> PRT 

<213> Porcine 

<400> 14 

Met Gin Lys Leu Gin He Tyr Val Tyr He Tyr Leu Phe Met Leu He 



Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 



20 



26 

25 



30 



Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Met Trp Arg Gin Asn Thr 



Lys Ser Ser Arg Leu Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 



Arg Leu Glu Thr Ala Pro Asn lie Ser Lys Asp Ala lie Arg Gin Leu 



Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp Val 



Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 



Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Leu Leu 
115 120 125 



Met Gin Val Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu 
145 150 155 160 



Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu 
165 170 175 



lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu 
180 185 190 



Lys Leu Asp Met Asn Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr 
340 345 350 



Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 


15 


<211> 


1128 


<212> 


DNA 


<213> 


Ovine 


<220> 




<221> 


CDS 


<222> 


(1) . . 


<400> 


15 


atg caa 


aaa < 


Met Gin 


. Lys 1 


1 





gtt get ggc cca gtg gat ctg aat gag aac age gag cag aag gaa aat 96 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 

gtg gaa aaa aag ggg ctg tgt aat gca tgc ttg tgg aga caa aac aat 144 
Val Glu Lys Lys Gly Leu Cys Asn Ala Cys Leu Trp Arg Gin Asn Asn 
35 40 45 

aaa tec tea aga eta gaa gee ata aaa ate caa ate etc agt aag ctt 192 
Lys Ser Ser Arg Leu Glu Ala He Lys He Gin He Leu Ser Lys Leu 
50 55 60 

cgc ctg gaa aca get cct aac ate age aaa gat get ata aga caa ctt 24 0 

Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu 



ttg ccc aag get cct cca etc egg gaa ctg att gat cag tac gat gtc 2 88 



28 

Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp Val 
85 90 95 



cag aga gat gac age age gac ggc tec ttg gaa gac gat gac tac cac 
Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

gtt acg acg gaa acg gtc att acc atg ccc acg gag tct gat ctt eta 
Val Thr Thr Glu Thr Val He Thr Met Pro Thr Glu Ser Asp Leu Leu 
115 120 125 

gca gaa gtg caa gaa aaa ccc aaa tgt tgc ttc ttt aaa ttt age tct 
Ala Glu Val Gin Glu Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aag ata caa cac aat aaa gta gta aag gee caa ctg tgg ata tat ctg 
Lys He Gin His Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 

aga cct gtc aag act cct aca aca gtg ttt gtg caa ate ctg aga etc 
Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 

ate aaa ccc atg aaa gac ggt aca agg tat act gga ate cga tct ctg 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

aaa ctt gac atg aac cca ggc act ggt att tgg cag age att gat gtg 
Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 

aag aca gtg ttg caa aac tgg etc aaa caa cct gaa tec aac tta ggc 
Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

att gaa ate aaa get tta gat gag aat ggt cat gat ctt get gta acc 
lie Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 

ttc cca gaa cca gga gaa gaa gga ctg aat cct ttt tta gaa gtc aag 
Phe Pro Glu Pro Gly Glu Glu Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 

gta aca gac aca cca aaa aga tct agg aga gat ttt ggg ctt gat tgt 
Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 

gat gag cac tec aca gaa tct cga tgc tgt cgt tac cct eta act gtg 
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 

gat ttt gaa get ttt gga tgg gat tgg att att gca cct aaa aga tat 
Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 

aag gee aat tac tgc tct gga gaa tgt gaa ttt tta ttt ttg caa aag 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Leu Phe Leu Gin Lys 
305 310 315 320 

tat cct cat acc cat ctt gtg cac caa gca aac ccc aaa ggt tea gee 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Lys Gly Ser Ala 



29 

325 330 335 

ggc cct tgc tgt act cct aca aag atg tct cca att aat atg eta tat 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 

ttt aat ggc aaa gaa caa ata ata tat ggg aag att cca ggc atg gta 
Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Gly Met Val 
355 360 365 

gta gat cgc tgt ggg tgc tea tga 
Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 16 

<211> 375 

<212> PRT 

<213> Ovine 

<400> 16 

Met Gin Lys Leu Gin lie Phe Val Tyr lie Tyr Leu Phe Met Leu Leu 
15 10 15 

Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 

Val Glu Lys Lys Gly Leu Cys Asn Ala Cys Leu Trp Arg Gin Asn Asn 
35 40 45 

Lys Ser Ser Arg Leu Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 
50 55 60 

Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu 
65 70 75 80 

Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 



Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 



Val Thr Thr Glu Thr Val He Thr Met Pro Thr Glu Ser Asp Leu Leu 
115 120 125 



Ala Glu Val Gin Glu Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys He Gin His Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 



Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu 
165 170 175 



lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 



Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Glu Pro Gly Glu Glu Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Leu Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Lys Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr 
340 345 350 



Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Gly Met Val 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 17 

<211> 1128 

<212> DNA 

<213> Meleagris gallopavo 



<220> 
<221> 
<222> 



CDS 

(1) . . (1125) 



, <400> 17 

atg caa aag eta gca gtc tat gtt tat att tac ctg ttc atg cag att 
Met Gin Lys Leu Ala Val Tyr Val Tyr lie Tyr Leu Phe Met Gin He 



tta gtt cat ccg gtg get ctt gat ggc agt agt cag ccc aca gag aac 
Leu Val His Pro Val Ala Leu Asp Gly Ser Ser Gin Pro Thr Glu Asn 



get gaa aaa gac gga ctg tgc aat get tgc acg tgg aga cag aat act 
Ala Glu Lys Asp Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 



aaa tec tec aga ata gaa gee ata aaa att caa ate etc age aaa ctg 
Lys Ser Ser Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu 



cgc ctg gaa caa gca cct aac att age agg gac gtt att aaa caa ctt 
Arg Leu Glu Gin Ala Pro Asn He Ser Arg Asp Val He Lys Gin Leu 



tta ccc aaa get cct ccg ctg cag gaa ctg att gat cag tat gac gtc 
Leu Pro Lys Ala Pro Pro Leu Gin Glu Leu He Asp Gin Tyr Asp Val 



cag aga gac gac agt age gat ggc tct ttg gaa gac gat gac tat cat 
Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

gec aca ace gaa acg att ate aca atg cct acg gag tct gat ttt ctt 
Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 

gta caa atg gag gga aaa cca aaa tgt tgc ttc ttt aag ttt age tct 
Val Gin Met Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

aaa ata caa tat aac aaa gta gta aag gca caa tta tgg ata tac ttg 
Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 

agg caa gtc caa aaa cct aca acg gtg ttt gtg cag ate ctg aga etc 
Arg Gin Val Gin Lys Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 

att aaa ccc atg aaa gac ggt aca aga tat act gga att cga tct ttg 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

aaa ctt gac atg aac cca ggc act ggt ate tgg cag agt att gat gtg 
Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 

aag aca gtg ttg caa aat tgg etc aaa cag cct gaa tec aat tta ggc 
Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



32 



ate gaa ata aaa get ttt gat gag aat gga cga gat ctt get gta aca 72 0 

lie Glu lie Lys Ala Phe Asp Glu Asn Gly Arg Asp Leu Ala Val Thr 
225 230 235 240 

ttc cca gga cca ggt gaa gat gga ctg aac cca ttt tta gag gtc aga 768 
Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Arg 
245 250 255 

gtt aca gac aca cca aaa egg tec cgc aga gat ttt ggc ctt gac tgc 816 
Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 

gac gag cac tea acg gaa tct cga tgt tgt cgc tac ccg ctg aca gtg 864 
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 

gat ttt gaa get ttt gga tgg gac tgg att ata gca cct aaa aga tac 912 
Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr 
290 295 300 

aaa gec aat tac tgc tct gga gaa tgt gaa ttc gta ttt eta cag aaa 960 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 

tac ccg cac act cac ctg gta cac caa gca aat cca aga ggc tea gca 1008 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 

ggc cct tgc tgc aca ccc acc aag atg tec cct ata aac atg ctg tat 1056 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr 
340 345 350 

ttc aat gga aaa gaa caa ata ata tat gga aag ata cca gec atg gtt 1104 
Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val 
355 360 365 

gta gat cgt tgc ggg tgc tea tga 112 8 

Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 18 
<211> 375 
<212> PRT 

<213> Meleagris gallopavo 
<400> 18 

Met Gin Lys Leu Ala Val Tyr Val Tyr He Tyr Leu Phe Met Gin He 
15 10 15 

Leu Val His Pro Val Ala Leu Asp Gly Ser Ser Gin Pro Thr Glu Asn 



Ala Glu Lys Asp Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 



33 

Lys Ser Ser Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 
50 55 60 



Arg Leu Glu Gin Ala Pro Asn lie Ser Arg Asp Val lie Lys Gin Leu 



Leu Pro Lys Ala Pro Pro Leu Gin Glu Leu lie Asp Gin Tyr Asp Val 



Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 



Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 



Val Gin Met Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 



Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu 
145 150 155 160 



Arg Gin Val Gin Lys Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu 
165 170 175 



lie Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu 
180 185 190 



Lys Leu Asp Met Asn Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val 
195 200 205 



Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 



lie Glu lie Lys Ala Phe Asp Glu Asn Gly Arg Asp Leu Ala Val Thr 
225 230 235 240 



Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Arg 
245 250 255 



Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 



Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 



Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 



Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 



Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 



Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val 
355 360 365 



Val Asp Arg Cys Gly Cys Ser 
370 375 



<210> 19 

<211> 1125 

<212> DNA 

<213> Danio rerio 

<220> 

<221> CDS 

<222> (1) . . (1122) 

<400> 19 

atg cat ttt aca cag gtt tta att tct eta agt gta tta att gca tgt 4 8 

Met His Phe Thr Gin Val Leu lie Ser Leu Ser Val Leu lie Ala Cys 
15 10 15 

ggt cca gtg ggt tat gga gat ata acg gcg cac cag cag cct tec aca 96 
Gly Pro Val Gly Tyr Gly Asp lie Thr Ala His Gin Gin Pro Ser Thr 
20 25 30 

gee acg gag gaa age gag ctg tgt tec aca tgt gag ttc aga caa cac 144 
Ala Thr Glu Glu Ser Glu Leu Cys Ser Thr Cys Glu Phe Arg Gin His 
35 40 45 

age aag ctg atg aga ctg cat gec ate aag tec caa att ctt age aaa 192 
Ser Lys Leu Met Arg Leu His Ala lie Lys Ser Gin lie Leu Ser Lys 



etc cga etc aag cag get cca aac ate age egg gac gtg gtc aag cag 
Leu Arg Leu Lys Gin Ala Pro Asn lie Ser Arg Asp Val Val Lys Gin 



ctg tta ccc aaa gca ccg cct ttg caa caa ctt ctg gat cag tac gat 
Leu Leu Pro Lys Ala Pro Pro Leu Gin Gin Leu Leu Asp Gin Tyr Asp 



gtt tta gga gat gac agt aag gat gga get gtg gaa gag gac gat gaa 336 
Val Leu Gly Asp Asp Ser Lys Asp Gly Ala Val Glu Glu Asp Asp Glu 
100 105 110 
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cat gcc acc aca gag acc ate atg acc atg gec aca gaa cct gac ccc 
His Ala Thr Thr Glu Thr lie Met Thr Met Ala Thr Glu Pro Asp Pro 
115 120 125 

att gtt caa gta gat egg aaa ccg aag tgt tgc ttt ttc tec ttc agt 
lie Val Gin Val Asp Arg Lys Pro Lys Cys Cys Phe Phe Ser Phe Ser 
130 135 140 

ccg aag ate caa gcg aac egg ate gta aga gcg cag etc tgg gtt cat 
Pro Lys lie Gin Ala Asn Arg lie Val Arg Ala Gin Leu Trp Val His 
145 150 155 160 

ctg aga ccg gcg gag gag gcg acc acc gtc ttc tta cag ata tct egg 
Leu Arg Pro Ala Glu Glu Ala Thr Thr Val Phe Leu Gin lie Ser Arg 
165 170 175 

ctg atg ccc gtt aag gac gga gga aga cac cga ata cga tec ctg aaa 
Leu Met Pro Val Lys Asp Gly Gly Arg His Arg lie Arg Ser Leu Lys 
180 185 190 

ate gac gtg aac gca gga gtc acg tct tgg cag agt ata gac gta aag 
lie Asp Val Asn Ala Gly Val Thr Ser Trp Gin Ser lie Asp Val Lys 
195 200 205 

cag gtg etc acg gtg tgg tta aaa caa ccg gag acc aac cga ggc ate 
Gin Val Leu Thr Val Trp Leu Lys Gin Pro Glu Thr Asn Arg Gly lie 
210 215 220 

gag att aac gca tat gac gcg aag gga aac gac ttg gcc gtc act tea 
Glu lie Asn Ala Tyr Asp Ala Lys Gly Asn Asp Leu Ala Val Thr Ser 
225 230 235 240 

acc gag act ggg gag gat gga ctg etc ccc ttt atg gag gtg aaa ata 
Thr Glu Thr Gly Glu Asp Gly Leu Leu Pro Phe Met Glu Val Lys lie 
245 250 255 

tea gag ggc cca aaa cga ate egg agg gac tec gga ctg gac tgc gat 
Ser Glu Gly Pro Lys Arg lie Arg Arg Asp Ser Gly Leu Asp Cys Asp 
260 265 270 

gag aat tec tea gag tct cgc tgc tgc agg tac cct etc act gtg gac 
Glu Asn Ser Ser Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp 
275 280 285 

ttc gag gac ttt ggc tgg gac tgg att att get cca aaa cgc tat aag 
Phe Glu Asp Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys 
290 295 300 

gcg aat tac tgt tea gga gaa tgc gac tac atg tac ctg cag aag tat 
Ala Asn Tyr Cys Ser Gly Glu Cys Asp Tyr Met Tyr Leu Gin Lys Tyr 
305 310 315 320 

ccc cac acc cat ctg gtg aac aag gcc agt ccg aga gga acg get ggg 
Pro His Thr His Leu Val Asn Lys Ala Ser Pro Arg Gly Thr Ala Gly 
325 330 335 

ccc tgc tgc act ccc acc aag atg tct ccc ate aac atg ctt tac ttt 
Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe 
340 345 350 

aac ggc aaa gag cag ate ate tac ggc aag ate cct teg atg gta gta 
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Asn Gly Lys Glu Gin lie He Tyr Gly Lys He Pro Ser Met Val Val 
355 360 365 



gac cgc tgt ggc tgc tea tga 
Asp Arg Cys Gly Cys Ser 
370 



<210> 20 

<211> 374 

<212> PRT 

<213> Danio rerio 



<400> 20 

Met His Phe Thr Gin Val Leu He Ser Leu Ser Val Leu He Ala Cys 



Gly Pro Val Gly Tyr Gly Asp He Thr Ala His Gin Gin Pro Ser Thr 



Ala Thr Glu Glu Ser Glu Leu Cys Ser Thr Cys Glu Phe Arg Gin His 



Ser Lys Leu Met Arg Leu His Ala He Lys Ser Gin He Leu Ser Lys 



Leu Arg Leu Lys Gin Ala Pro Asn He Ser Arg Asp Val Val Lys Gin 



Leu Leu Pro Lys Ala Pro Pro Leu Gin Gin Leu Leu Asp Gin Tyr Asp 



Val Leu Gly Asp Asp Ser Lys Asp Gly Ala Val Glu Glu Asp Asp Glu 
100 105 110 



His Ala Thr Thr Glu Thr He Met Thr Met Ala Thr Glu Pro Asp Pro 
115 120 125 



He Val Gin Val Asp Arg Lys Pro Lys Cys Cys Phe Phe Ser Phe Ser 
130 135 140 



Pro Lys He Gin Ala Asn Arg He Val Arg Ala Gin Leu Trp Val His 
145 150 155 160 



Leu Arg Pro Ala Glu Glu Ala Thr Thr Val Phe Leu Gin He Ser Arg 
165 170 175 



Leu Met Pro Val Lys Asp Gly Gly Arg His Arg He Arg Ser Leu Lys 
180 185 190 
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He Asp Val Asn Ala Gly Val Thr Ser Trp Gin Ser He Asp Val Lys 
195 200 205 



Gin Val Leu Thr Val Trp Leu Lys Gin Pro Glu Thr Asn Arg Gly He 
210 215 220 



Glu He Asn Ala Tyr Asp Ala Lys Gly Asn Asp Leu Ala Val Thr Ser 
225 230 235 240 



Thr Glu Thr Gly Glu Asp Gly Leu Leu Pro Phe Met Glu Val Lys He 
245 250 255 



Ser Glu Gly Pro Lys Arg He Arg Arg Asp Ser Gly Leu Asp Cys Asp 
260 265 270 



Glu Asn Ser Ser Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp 
275 280 285 



Phe Glu Asp Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys 
290 295 300 



Ala Asn Tyr Cys Ser Gly Glu Cys Asp Tyr Met Tyr Leu Gin Lys Tyr 
305 310 315 320 



Pro His Thr His Leu Val Asn Lys Ala Ser Pro Arg Gly Thr Ala Gly 
325 330 335 



Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe 
340 345 350 



Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ser Met Val Val 
355 360 365 



Asp Arg Cys Gly Cys Ser 
370 



<210> 21 

<211> 4 

<212> PRT 

<213> artificial 

<220> 

<223> proteolytic cleavage site 
<220> 

<221> VARIANT 

<222> (1) . . (4) 



<223> Xaa is any amino acid 

<400> 21 

Arg Xaa Xaa Arg 



<210> 22 

<211> 4 

<212> PRT 

<213> artificial 

<220> 

<223> proteolytic processing site 

<400> 22 

Arg, Ser Arg Arg 



<210> 23 

<211> 4 

<212> PRT 

<213> artificial 

<220> 

<223> proteolytic processing site 

<400> 23 

Arg lie Arg Arg 
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ata etc qcg gec ccg ctg ctg ctg ggc ttc ctg etc etc gec ctg gag 
?al Leu Ala Ala Pro Leu Leu Leu Gly Phe Leu Leu Leu Ala Leu Glu 
5 10 15 

etc, egg ccc egg ggg gag gcg gee gag ggc cce gcg gcg gcg gcg gcg 
lei Arg Pro Arg 111 llu Ala Ala Glu Gly Pro Ala Ala Ala Ala Ala 
20 25 30 

gcg gcg gcg gcg gcg gca gcg gcg ggg gtc ggg ggg gag cgc tec age 
Til All 111 111 Ala Ala Ala Ala Gly Val Gly Gly Glu Arg Ser Ser 

35 40 45 

egg cca gec ccg tec gtg gcg cec gag ccg gac ggc tgc ccc gtg tgc 
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Arg Pro Ala Pro Ser Val Ala Pro Glu Pro Asp Gly Cys Pro Val Cys 



gtt tgg egg cag cac age cgc gag ctg cgc eta gag age ate aag teg 
Val Trp Arg Gin His Ser Arg Glu Leu Arg Leu Glu Ser lie Lys Ser 



cag ate ttg age aaa ctg egg etc aag gag gcg ccc aac ate age cgc 
Gin lie Leu Ser Lys Leu Arg Leu Lys Glu Ala Pro Asn lie Ser Arg 



gag gtg gtg aag cag ctg ctg ccc aag gcg ccg ccg ctg cag cag ate 

Glu Val Val Lys Gin Leu Leu Pro Lys Ala Pro Pro Leu Gin Gin lie 

100 105 110 

ctg gac eta cac gac ttc cag ggc gac gcg ctg cag ccc gag gac ttc 

Leu Asp Leu His Asp Phe Gin Gly Asp Ala Leu Gin Pro Glu Asp Phe 

115 120 125 

ctg gag gag gac gag tac cac gee acc acc gag acc gtc att age atg 
Leu Glu Glu Asp Glu Tyr His Ala Thr Thr Glu Thr Val lie Ser Met 

130 135 140 145 

gec cag gag acg gac cca gca gta cag aca gat ggc age cct etc tgc 

Ala Gin Glu Thr Asp Pro Ala Val Gin Thr Asp Gly Ser Pro Leu Cys 

150 155 160 

tgc cat ttt cac ttc age ccc aag gtg atg ttc aca aag gta ctg aag 

Cys His Phe His Phe Ser Pro Lys Val Met Phe Thr Lys Val Leu Lys 

165 170 175 

gec cag ctg tgg gtg tac eta egg cct gta ccc cgc cca gee aca gtc 

Ala Gin Leu Trp Val Tyr Leu Arg Pro Val Pro Arg Pro Ala Thr Val 

180 185 190 

tac ctg cag ate ttg cga eta aaa ccc eta act ggg gaa ggg acc gca 

Tyr Leu Gin lie Leu Arg Leu Lys Pro Leu Thr Gly Glu Gly Thr Ala 

195 200 205 

999 99 a 999 99^ 99& 99<z egg cgt cac ate cgt ate cgc tea ctg aag 

Gly Gly Gly Gly Gly Gly Arg Arg His lie Arg lie Arg Ser Leu Lys 

210 215 220 225 

att gag ctg cac tea cgc tea ggc cat tgg cag age ate gac ttc aag 

lie Glu Leu His Ser Arg Ser Gly His Trp Gin Ser lie Asp Phe Lys 

230 235 240 

caa gtg eta cac age tgg ttc cgc cag cca cag age aac tgg ggc ate 

Gin Val Leu His Ser Trp Phe Arg Gin Pro Gin Ser Asn Trp Gly lie 

245 250 255 

gag ate aac gec ttt gat ccc agt ggc aca gac ctg get gtc acc tec 

Glu lie Asn Ala Phe Asp Pro Ser Gly Thr Asp Leu Ala Val Thr Ser 

260 265 270 

ctg ggg ccg gga gec gag ggg ctg cat cca ttc atg gag ctt cga gtc 

Leu Gly Pro Gly Ala Glu Gly Leu His Pro Phe Met Glu Leu Arg Val 

275 280 285 



eta gag aac aca aaa cgt tec egg egg aac ctg ggt ctg gac tgc gac 
Leu Glu Asn Thr Lys Arg Ser Arg Arg Asn Leu Gly Leu Asp Cys Asp 



968 
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290 295 300 305 

gag cac tea age gag tec cgc tgc tgc cga tat ccc etc aca gtg gac 1016 
Glu His Ser Ser Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp 
310 315 320 

ttt gag get ttc ggc tgg gac tgg ate ate gca cct aag cgc tac aag 10 64 

Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys 
325 330 335 

gee aac tac tgc tec ggc cag tgc gag tac atg ttc atg caa aaa tat 1112 
Ala Asn Tyr Cys Ser Gly Gin Cys Glu Tyr Met Phe Met Gin Lys Tyr 
340 345 350 

ccg cat ace cat ttg gtg cag cag gec aat cca aga ggc tct get ggg 1160 
Pro His Thr His Leu Val Gin Gin Ala Asn Pro Arg Gly Ser Ala Gly 
355 360 365 

ccc tgt tgt acc ccc acc aag atg tec cca ate aac atg etc tac ttc 1208 
Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe 
370 375 380 385 

aat gac aag cag cag att ate tac ggc aag ate cct ggc atg gtg gtg 1256 
Asn Asp Lys Gin Gin lie lie Tyr Gly Lys He Pro Gly Met Val Val 
390 395 400 

gat cgc tgt ggc tgc tct taagtgggtc actacaagct gctggagcaa 1304 
Asp Arg Cys Gly Cys Ser 
405 

agacttggtg ggtgggtaac ttaacctctt cacagaggat aaaaaatget tgtgagtatg 1364 
acagaaggga ataaacaggc ttaaagggt 13 93 

<210> 25 

<211> 407 

<212> PRT 

<213> Homo sapiens 

<400> 25 

Met Val Leu Ala Ala Pro Leu Leu Leu Gly Phe Leu Leu Leu Ala Leu 



Glu Leu Arg Pro Arg Gly Glu Ala Ala Glu Gly Pro Ala Ala Ala Ala 



Ala Ala Ala Ala Ala Ala Ala Ala Ala Gly Val Gly Gly Glu Arg Ser 



Ser Arg Pro Ala Pro Ser Val Ala Pro Glu Pro Asp Gly Cys Pro Val 



Cys Val Trp Arg Gin His Ser Arg Glu Leu Arg Leu Glu Ser He Lys 
65 70 75 80 
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Ser Gin lie Leu Ser Lys Leu Arg Leu Lys Glu Ala Pro Asn lie Ser 



Arg Glu Val Val Lys Gin Leu Leu Pro Lys Ala Pro Pro Leu Gin Gin 
100 105 110 



He Leu Asp Leu His Asp Phe Gin Gly Asp Ala Leu Gin Pro Glu Asp 
115 120 125 



Phe Leu Glu Glu Asp Glu Tyr His Ala Thr Thr Glu Thr Val He Ser 
130 135 140 



Met Ala Gin Glu Thr Asp Pro Ala Val Gin Thr Asp Gly Ser Pro Leu 
145 150 155 160 



Cys Cys His Phe His Phe Ser Pro Lys Val Met Phe Thr Lys Val Leu 
165 170 175 



Lys Ala Gin Leu Trp Val Tyr Leu Arg Pro Val Pro Arg Pro Ala Thr 
180 185 190 



Val Tyr Leu Gin He Leu Arg Leu Lys Pro Leu Thr Gly Glu Gly Thr 
195 200 205 



Ala Gly Gly Gly Gly Gly Gly Arg Arg His He Arg He Arg Ser Leu 
210 215 220 



Lys He Glu Leu His Ser Arg Ser Gly His Trp Gin Ser He Asp Phe 
225 230 235 240 



Lys Gin Val Leu His Ser Trp Phe Arg Gin Pro Gin Ser Asn Trp Gly 
245 250 255 



He Glu lie Asn Ala Phe Asp Pro Ser Gly Thr Asp Leu Ala Val Thr 
260 265 270 



Ser Leu Gly Pro Gly Ala Glu Gly Leu His Pro Phe Met Glu Leu Arg 
275 280 285 



Val Leu Glu Asn Thr Lys Arg Ser Arg Arg Asn Leu Gly Leu Asp Cys 
290 295 300 



Asp Glu His Ser Ser Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
305 310 315 320 
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Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 
325 330 335 



Lys Ala Asn Tyr Cys Ser Gly Gin Cys Glu Tyr Met Phe Met Gin Lys 
340 345 350 



Tyr Pro His Thr His Leu Val Gin Gin Ala Asn Pro Arg Gly Ser Ala 
355 360 365 



Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
370 375 380 



Phe Asn Asp Lys Gin Gin lie lie Tyr Gly Lys lie Pro Gly Met Val 
385 390 395 400 



Val Asp Arg Cys Gly Cys Ser 
405 



<210> 26 
<211> 476 



<212> DNA 

<213> Salmon 

<220> 

<221> CDS 

<222> (3) . . (473) 

<223> Salmon-1 



<400> 26 

gg cag ccg gag acg aat tgg ggg ate gag att aat gcg ttc gac teg 
Gin Pro Glu Thr Asn Trp Gly lie Glu lie Asn Ala Phe Asp Ser 



aa 9 S9 a a at gat ctg gcc gtt acc tea gca gaa gcg gga gaa gga ctg 
Lys Gly Asn Asp Leu Ala Val Thr Ser Ala Glu Ala Gly Glu Gly Leu 



caa ccc ttc atg gag gtg acg att tea gag ggc ccg aag cgc tec agg 
Gin Pro Phe Met Glu Val Thr lie Ser Glu Gly Pro Lys Arg Ser Arg 



aga gac teg ggc ctg gac tgt gac gag aac tec ccc gag tec cgc tgt 
Arg Asp Ser Gly Leu Asp Cys Asp Glu Asn Ser Pro Glu Ser Arg Cys 



tgc cgc tac ccc etc acg gta gac ttt gaa gac ttt ggc tgg gac tgg 
Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu Asp Phe Gly Trp Asp Trp 



att att gcc ccc aag cgc tac aag gcc aac tac tgc tct ggt gag tgt 
lie lie Ala Pro Lys Arg Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys 



gag tac atg cac ctg cag aag tac ccc cac acc cac ctg gtg aac aag 335 



43 

Glu Tyr Met His Leu Gin Lys Tyr Pro His Thr His Leu Val Asn Lys 



get aac cct cgc ggc acc gca ggg ccc tgc tgc acc ccc acc aag atg 
Ala Asn Pro Arg Gly Thr Ala Gly Pro Cys Cys Thr Pro Thr Lys Met 
115 120 125 

tec ccc ate aac atg etc tac ttc aac cgc aaa gag cag ate ate tac 
Ser Pro lie Asn Met Leu Tyr Phe Asn Arg Lys Glu Gin lie lie Tyr 
130 135 140 

ggc aag ate ccc tec atg gtg gtg gac cgt tgc gga tgc teg tga 
Gly Lys lie Pro Ser Met Val Val Asp Arg Cys Gly Cys Ser 
145 150 155 



<210> 27 

<211> 157 

<212> PRT 

<213> Salmon 

<400> 27 

Gin Pro Glu Thr Asn Trp Gly lie Glu lie Asn Ala Phe Asp Ser Lys 
15 10 15 

Gly Asn Asp Leu Ala Val Thr Ser Ala Glu Ala Gly Glu Gly Leu Gin 
20 25 30 

Pro Phe Met Glu Val Thr lie Ser Glu Gly Pro Lys Arg Ser Arg Arg 
35 40 45 

Asp Ser Gly Leu Asp Cys Asp Glu Asn Ser Pro Glu Ser Arg Cys Cys 
50 55 60 

Arg Tyr Pro Leu Thr Val Asp Phe Glu Asp Phe Gly Trp Asp Trp lie 
65 70 75 80 

lie Ala Pro Lys Arg Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu 



Tyr Met His Leu Gin Lys Tyr Pro His Thr His Leu Val Asn Lys Ala 
100 105 110 



Asn Pro Arg Gly Thr Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser 
115 120 125 



Pro lie Asn Met Leu Tyr Phe Asn Arg Lys Glu Gin lie lie Tyr Gly 
130 135 140 



Lys lie Pro Ser Met Val Val Asp Arg Cys Gly Cys Ser 
145 150 155 
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<210> 28 

<211> 412 

<212> DNA 

<213> Salmon 

<220> 

<221> CDS 

<222> (2) . . (409) 

<223> Salmon-2 



gtt acc tea act gaa gee gga gaa gga ctg caa ccc ttc atg gag gtg 
Val Thr Ser Thr Glu Ala Gly Glu Gly Leu Gin Pro Phe Met Glu Val 



aag att teg gag ggc ccg aag cgc tec agg aga gat teg ggc ctg gac 
Lys lie Ser Glu Gly Pro Lys Arg Ser Arg Arg Asp Ser Gly Leu Asp 



tgt gat gag aac tec ccc gag tec cgc tgc tgc egg tac ccc etc acg 
Cys Asp Glu Asn Ser Pro Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr 



gtg gac ttt gaa gac ttt ggc tgg gac tgg att att gee ccc aag cgc 
Val Asp Phe Glu Asp Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg 



tac aag gee aac tac tgc tct ggt gag tgc gag tac atg cac ctg cag 
Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Tyr Met His Leu Gin 



aag tac ccc cac acc cac ctg gtg aac aag get aac cct cgc ggc acc 
Lys Tyr Pro His Thr His Leu Val Asn Lys Ala Asn Pro Arg Gly Thr 



gcg ggg ccc tgc tgc acc ccc acc aag atg tec ccc ate aac atg etc 

Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu 
100 105 110 

tac ttc aac cgc aaa gag cag ate ate tac ggc aag ate ccc tec atg 

Tyr Phe Asn Arg Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ser Met 

115 120 125 



gtg gtg gac cgc tgc ggc tgc teg tga 
Val Val Asp Arg Cys Gly Cys Ser 
130 135 



<210> 29 

<211> 136 

<212> PRT 

<213> Salmon 



<400> 29 

Val Thr Ser Thr Glu Ala Gly Glu Gly Leu Gin Pro Phe Met Glu Val 
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Lys lie Ser Glu Gly Pro Lys Arg Ser Arg Arg Asp Ser Gly Leu Asp 



Cys Asp Glu Asn Ser Pro Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr 



Val Asp Phe Glu Asp Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg 



Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Tyr Met His Leu Gin 



Lys Tyr Pro His Thr His Leu Val Asn Lys Ala Asn Pro Arg Gly Thr 



Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu 
100 105 110 



Tyr Phe Asn Arg Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ser Met 
115 120 125 



Val Val Asp Arg Cys Gly Cys Ser 
130 135 



